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Preface 


The concept of attitude has played a major role in the history of social psychology. 
From its relatively simple beginning as a state of preparedness or a set to make a par- 
ticular overt response, the concept has grown into its present-day formulation as a 
complex, multidimensional concept consisting of affective, cognitive, and conative com- 
ponents. As Gordon Allport pointed out more than thirty years ago, “,. . attitude is 
probably the most distinctive and indispensable concept in contemporary American 
social psychology. No other term appears more frequently in experimental and the- 
oretical literature.” Despite the enormous growth of social psychology, and the diversity 
of interest of contemporary social psychologists, Allport's words are as true today as 
they were in 1935. In addition, the attitude concept has come to play an increasingly 
important part in almost all of the behavioral sciences and many of the applied dis- 
ciplines. 

Because of the enormity of the attitude literature, and the proliferation of sources 
in which it appears, the study of attitudes has become difficult for all but the most 
dedicated students. Perhaps even more difficult, however, are the problems that con- 
front the teacher attempting to prepare a course in the attitude area. Not only is he 
likely to be overwhelmed by the sheer quantity and diversity of the literature, but he 
will soon be appalled by its relative inaccessibility to his students. It was primarily the 
latter problem that led to the preparation of this book. í " 

The initial plan was to bring together a collection of articles that would represent 
the three major facets of the study of attitude, that is, theory, measurement, and change. 
It soon became obvious, however, that no single volume could adequately sample all of 
the attitude literature. Since it has always been my opinion that a complete under- 
standing and analysis of attitude change is impossible without first having a thorough 
knowledge of attitude theory and measurement, I decided to restrict the book to these 
latter two topics. Even though this restriction greatly reduced the number of articles 
that were considered, many excellent articles discussing attitude theory and measure- 
ment have also had to be excluded. 

The selection of articles has been guided by several criteria. First, and foremost, I 
have deliberately emphasized. those articles that have already demonstrated their lasting 
significance to an understanding of the attitude concept. Every attempt has been made 
to present the original contributions of those authors whose names repeatedly occur in 
any review of the attitude literature. 

Second, although approximately half the book is concerned with attitude measurement, 
an attempt was made to select only those articles that are primarily concerned with the 
theory underlying attitude measurement rather than with the measurement process per 
se. It should be noted that in order to make the book maximally useful to all students, 
many commendable articles that involve extremely complex mathematical analyses have 
been omitted. 

Third, an attempt was made to sample as wide a variety of historical and contempo- 
rary approaches to attitude theory and measurement as was possible. 

Clearly, even within the limits established by these criteria, I have undoubtedly in- 
cluded articles that some instructors may find of little value, and have omitted others 
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that some of my colleagues will regard as indispensable. It is unfortunate that at least 
three of these omissions (the contributions of Leon Festinger, Bert Green, and Eckard 
Hess) have been necessitated by copyright laws. Although I am indebted to many col- 
leagues for recommendations and advice that guided the selection of articles, the re- 
sponsibility for the remainder of the omissions and commissions is solely my own. 

I am also indebted to the many authors (and publishers) who did grant permission to 
reprint their articles and, in particular, to those who graciously allowed me to edit their 
articles. I hope that my modification has in no way reduced the significance or meaning- 
fulness of their contributions. Similarly, I hope my use of editorial license to include 
some of my own views has not detracted from the quality of the remainder of the selec- 
tions. 

I thank also my colleagues and students at the University of Illinois who have con- 
tinually provided support and encouragement. In addition, I must thank two very special 
women: my secretary, Marlys Brown, who assisted me throughout every phase of the 
project and who capably handled almost all of the administrative details; and my wife, 
Deborah, who, in her usual competent manner, managed to bear with me while I was 
working on this book. 


Martin Fishbein 


Urbana, Illinois 
April 1967 
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READINGS IN 


Attitude Theory and Measurement 


PART I 


Historical Foundations 


1. Attitudes 


GORDON W. ALLPORT 


The concept of attitude is probably the most 
distinctive and indispensable concept in con- 
temporary American social psychology. No 
other term appears more frequently in ex- 
perimental and theoretical literature. Its popu- 
larity is not difficult to explain. It has come into 
favor, first of all, because it is not the property 
of any one psychological school of thought, and 
therefore serves admirably the purposes of eclec- 
tic writers. Furthermore, it is a concept which 
escapes the ancient controversy concerning the 
relative influence of heredity and environment. 
Since an attitude may combine both instinct 
and habit in any proportion, it avoids the ex- 
treme commitments of both the instinct-theory 
and environmentalism. The term likewise is 
elastic enough to apply either to the disposi- 
tions of single, isolated individuals or to broad 


patterns of culture. Psychologists and sociolo- ` 


gists therefore find in it a meeting point for 
discussion and research. This useful, one might 
almost say peaceful, concept has been so widely 
adopted that it has virtually established itself 
as the keystone in the edifice of American social 

psychology. In fact several writers (cf. Bogardus, 
: 1931; Thomas and Znaniecki, 1918; Folsom, 
1931) define social psychology as the scientific 
study of attitudes. 

As might be expected of so abstract and ser- 
viceable a term, it has come to signify many 
things to many writers, with the inevitable re- 
sult that its meaning is somewhat indefinite and 
its scientific status called into question. Among 
the critics (e.g, Bain, 1927-1928; McDougall, 
1933; Symonds, 1927), McDougall has been the 
most severe: 


American social psychologists and sociologists 


* Excerpted from an article in C. Murchison (Ed.), 
Handbook of Social Psychology, Clark University 
Press, Worcester, Mass., 1935, with permission of 
the author and the publisher. 


r 


have recently produced a voluminous literature 
concerning what they call “social attitudes”; 
the term is used to cover a multitude of facts 
of many kinds including almost every variety 
of opinion and belief and all the abstract quali- 
ties of personality, such as courage, obstinacy, 
generosity and humility, as well as the units of 
affective organization which are here called 
“sentiments.” I cannot see how progress in so- 
cial psychology can be made without a more 
discriminating terminology (1933, p. 219). 


It is undeniable that the concept of “atti- 
tude” has become something of a factotum for 
both psychologists and sociologists. But, in spite 
of all the animadversions of critics, the term is 
now in nearly universal use and plays a central 
rôle in most of the recent systematic studies in 
social psychology. It is therefore a concept 
which students must examine with unusual 
care. 


HISTORY OF THE CONCEPT OF ATTITUDE 


Like most abstract terms in the English lan- 
guage, attitude has more than one meaning. 
Derived from the Latin aptus, it has on the one 
hand the significance of “fitness” or “adapted- 
ness,” and like its by-form aptitude connotes a 
subjective or mental state of preparation for 
action. Through its use in the field of art, how- 
ever, the term came to have a quite indepen- 
dent meaning; it referred to the outward or 
visible posture (the bodily position) of a figure 
in statuary or painting. The first meaning is 
clearly preserved in modern psychology in what 
are often referred to as “mental attitudes”; and 
the second meaning in “motor attitudes.” Since 
mentalistic psychology historically precedes re- 
sponse psychology, it is only natural to find 
that mental attitudes are given recognition ear- 
lier than motor attitudes. One of the earliest 
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psychologists to employ the term was Herbert 
Spencer. In his First Principles (1862) he wrote: 


Arriving at correct judgments on disputed ques- 
tions, much depends on the attitude of mind 
we preserve while listening to, or taking part, 
in the controversy: and for the preservation of 
a right attitude it is needful that we should 
learn how true, and yet how untrue, are aver- 
age human beliefs (Vol. 1, pp. 1, i). 


Similarly in 1868 Alexander Bain wrote: 


The forces of the mind may have got into a set 
track or attitude, opposing a certain resistance 
as when some one subject engrosses our atten- 
tion, so that even during a break in the actual 
current of the thoughts, other subjects are not 
entertained (p. 158). 


Somewhat later, when psychologists were for- 
saking their exclusively mentalistic point of 
view, the concept of motor attitudes became 
popular. In 1888, for example, N. Lange de- 
veloped a motor theory wherein the process of 
a perception was considered to be in large part 
à consequence of muscular preparation or “set.” 
At about the same time Miinsterberg (1889) de- 
veloped his action theory of attention, and Féré 
(1890) maintained that a balanced condition of 
tension in the muscles was a determining condi- 
tion of selective consciousness. In 1895 Baldwin 
proposed motor attitudes as the basis for an 
understanding of emotional expression, and 
later writers, such as Giddings (1896) and Mead 
(1924-1925) expanded still further the réle of 
motor attitudes in social understanding. 

In recent years it is uncommon to find ex- 
plicit labeling of an attitude as either “mental” 
or “motor.” Such a practice smacks of body- 
mind dualism, and is therefore distasteful to 
contemporary psychologists. In nearly all cases 
today the term appears without a qualifying ad- 
jective, and implicitly retains both its original 
meanings: a mental aptness and a motor set. 
Attitude connotes a neuropsychic state of readi- 
ness for mental and physical activity. 


Attitudes in Experimental Psychology 


Perhaps the first explicit recognition of atti- 
tudes within the domain of laboratory psychol- 
ogy was in connection with a study of reaction- 
time. In 1888 L. Lange discovered that a sub- 
ject who was consciously prepared to press a 
telegraph key immediately upon receiving a 


signal reacted more quickly than did one whose 
attention was directed mainly to the incoming 
stimulus, and whose consciousness was not 
therefore directed primarily upon the expected 


.reaction. After Lange's work, the task-attitude, 


or Aufgabe, as it came to be called, was discov- 
ered to play a decisive part in nearly all psycho- 
logical experiments. Not only in the reaction 
experiment, but in investigations of perception, 
recall, judgment, thought, and volition, the cen- 
tral importance of the subjects preparedness 
became universally recognized. In Germany, 
where most of the experimental work was done, 
there arose a swarm of technical expressions to 
designate the varieties of mental and motor 
"sets" which influence the subjects' trains of 
thought or behavior during the experiment. In 
addition to the Aufgabe, there was the Absicht 
(conscious purpose), the Zielvorstellung (or idea 
of the goal), the Bezugsvorstellung (idea of the 
relation between the self and the object to 
which the self is responding), the Richtungsvor- 
stellung (or idea of direction), the determindie- 
rende Tendenz (any disposition which brings in 
its train the spontaneous appearance of a de- 
termined idea), the Einstellung, a more general 
term (roughly equivalent to "set"), the Haltung 
(with a more behavioral connotation), and the 
Bewusstseinslage (the "posture or lay of con- 
sciousness"). It was perhaps the lack of a gen- 
eral term equivalent to "attitude" that led the 
German experimentalists to discover so many 
types and forms. 

Then came the lively controversy over the 
place of attitudes in consciousness. The Wiirz- 
burg school was agreed that attitudes were nei- 
ther sensation, nor imagery, nor affection, nor 
any combination of these states. Time and 
again they were studied by the method of intro- 
spection, always with meager results. Often an 
attitude seemed to have no representation in 
consciousness other than a vague sense of need, 
or some indefinite and unanalyzable feeling of 
doubt, assent, conviction, effort, or familiarity. 
(Cf. Fearing, 1931; Titchener, 1909.) 

As a result of the Würzburg work all psychol- 
ogists came to accept attitudes, but not all be- 
lieved them to be impalpable and irreducible 
mental elements. Marbe's conception of the Be- 
wusstseinslage as an “obvious fact of conscious- 
ness, whose contents, nevertheless, either do not 
permit at all of a detailed characterization, or 
are at any rate difficult to characterize" became 
a particular bone of contention. In general, the 
followers of Wundt believed that attitudes 


could be accounted for adequately as feelings, 
particularly as some blend of striving and ex- 
citement. Clarke (1911), a pupil of Titchener, 
found that attitudes in large part are repre- 
sented in consciousness through imagery, sensa-, 
tion, and affection, and that where no such 
states are reported there is presumably merely 
a decay or abbreviation of these same constitu- 
ents. 

However they might disagree upon the nature 
of attitudes in so far as they appear in con- 
sciousness, all investigators, even the most or- 
thodox, came to admit attitudes as an indispens- 
able part of their psychological armamentar- 
ium. Titchener is a case in point. His Outline 
of Psychology in 1899 contained no reference 
to attitude; ten years later, in his Textbook of 
Psychology, several pages are given to the sub- 
ject, and its systematic importance is fully rec- 
ognized: 


Behind everything lies a cortical set, a nervous 
bias, perhaps inherited and permanent, perhaps 
acquired and temporary. This background may 
not appear in consciousness at all; or it may 
appear as a vague, conscious attitude (passive 
imagination), or again as a more or less definite 
plan, aim, ambition, intention (active imagina- 
tion). Whether conscious or not, the nervous 
disposition determines the course of conscious- 
ness (1916, Section 119). 


The meagerness with which attitudes are rep- 
resented in consciousness resulted in a tendency 
to regard them as manifestations of brain activ- 
ity or of the unconscious mind. The persistence 
of attitudes which are totally unconscious was 
demonstrated by Miiller and Pilzecker (1900), 
who called the phenomenon “perseveration.” 
The tendency of the subject to slip into some 
frame of mind peculiar to himself led Koffka 
(1912) to postulate “latent attitudes.” Washburn 
(1916) characterized attitudes as "static move- 
ment systems" within the organs of the body 
and the brain. Other writers, still more physio- 
logically inclined, subsumed attitudes under 
neurological rubrics: traces, neurograms, incito- 
grams, brain-patterns, and the like. 


Psychoanalytic Influence 

The contribution of the Würzburger and of 
all other experimental psychologists was in ef- 
fect the demonstration that the concept of atti- 
tude is indispensable. The discovery that atti- 
tudes are to a large degree unconscious, how- 
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ever, tended to discourage them from a further 
study of the problem. Once a phenomenon has 
been driven, as it were, to take refuge in ner- 
vous tissue, and identified with cortical sets and 
brain fields, the psychologist, at least thé intro- 
spectionist, is disinclined to pursue it further. 
The tendency of experimental orthodoxy is to 
admit the crucial part played by attitudes in all 
mental operations, but to consign them to the 
mysterious limbo of "motivation" and there to 
leave them. 

It was the influence of Freud, of course, that 
resurrected attitudes from this obscurity and en- 
dowed them with vitality, identifying them with 
longing, hatred and love, with passion and prej- 
udice, in short, with the onrushing stream of 
unconscious life. Without the painstaking la- 
bors of the experimentalists attitudes would not 
today be an established concept in the field of 
psychology, but also without the influence of 
psychoanalytic theory they would certainly have 
remained relatively lifeless, and would not have 
been of much assistance to social psychology 
which deals above all else with full-blooded 
phenomena. For the explanation of prejudice, 
loyalty, credulity, patriotism, and the passions 
of the mob, no anemic conception of attitudes 
will suffice. 


Attitudes in Sociology 

For a number of years sociologists have sought 
to supplement their cultural concepts with a 
psychology which might express in concrete 
terms the mechanisms through which culture is 
carried. At first, under the influence of Bagehot, 
Tarde, and Baldwin, a somewhat vaguely pos- 
tulated instinct of imitation (or suggestion) was 
thought adequate. Somewhat later the basis was 
sought in a more varied native equipment of 
men. It is interesting to note that of the first 
two textbooks in social psychology, both pub- 
lished in the year 1908, the one, by Ross, marks 
the demise of the "simple and sovereign" psy- 
chology of imitation-suggestion, and the other, 
by McDougall, marks the commencement of 
the still more vigorous social psychology of in- 
stincts. 

The instinct-hypothesis did not satisfy social 
scientists for long, for the very nature of their 
work forced them to recognize the importance 
of custom and environment in shaping social 
behavior. The instinct-hypothesis has precisely 
the contrary emphasis. What they required was 
a new psychological concept which would es- 
cape on the one hand from the hollow imper- 
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sonality of “custom” and “social force,” and on 
the other from nativism. Being committed to 
some psychological doctrine and dissatisfied 
with instincts they gradually adopted the con- 
cept of attitude. 

The case of Dewey may be taken as fairly 
typical. In 1917 he professed to see in the doc- 
trine of instincts an adequate basis for a social 
psychology. Five years later (1922) he no longer 
found instincts suitable and sought to replace 
them with a concept that would 


+». express that kind of human activity which 
is influenced by prior activity and in that sense 
acquired; which contains within itself a certain 
ordering or systematization of minor elements 
of action; which is projective, dynamic in qual- 
ity, ready for overt manifestation; and which is 
operative in some subdued subordinate form 
even when not obviously dominating activity 
(p. 41). 


To express this complex type of mental organi- 
zation he chose "habit," but admitted as its 
equivalent either "disposition" or "attitude." 
The credit for instituting the concept of atti- 
tude as a permanent and central feature in so- 
ciological writing must be assigned to Thomas 
and Znaniecki (1918), who gave it systematic 
priority in their monumental study of Polish 
peasants. Before this time the term had made 
only sporadic appearances in sociological litera- 
ture, but immediately afterward it was adopted 
with enthusiasm by scores of writers. 
According to Thomas and Znaniecki the 
study of attitudes is par excellence the field of 
social psychology. Attitudes are individual men- 
tal processes which determine both the actual 
and potential responses of each person in the 
social world. Since an attitude is always directed 
toward some object it may be defined as a “state 
of mind of the individual toward a value." Val- 
ues are usually social in nature, that is to say 
they are objects of common regard on the part 
of socialized men. Love of money, desire for 
fame, hatred of foreigners, respect for a scien- 
tific doctrine, are typical attitudes. It follows 
that money, fame, foreigners, and a scientific 
theory are values. A social value is defined as 
“any datum having an empirical content acces- 
sible to the members of some Social group and 
a meaning with regard to which it is or may 
be an object of activity” (p. 21). There are, to 
be sure, numerous attitudes corresponding to 
every social value; there are, for example, many 
views or attitudes regarding the church or the 


state. There are also numerous possible values 
for any single attitude. The iconoclast may di- 
rect his attacks quite at random upon all the 
established social values, or the Philistine may 
accept them all uncritically. To a large extent, of 
course, new social values are created by the atti- 
tudes which are common to many men, but 
these attitudes themselves depend upon pre- 
existing social values. Hence in the social 
world, as studied by the sociologist, both values 
and attitudes must have a place. Primarily it 
falls to the ethnologist and philosopher to ex- 
amine values; but it is social psychology which 
is "precisely the science of attitudes." 

"The authors draw a distinction between atti- 
tudes of temperament and of character; the 
former include what psychologists have been 
accustomed to speak of as instincts and innate 
aptitudes; the latter are the acquired operations 
of the socialized mind—the plans, interests, and 
sympathies which characterize the average citi- 
zen. The authors admit likewise a distinction 
between natural attitudes (toward the physical 
environment), which are of slight interest to so- 
cial psychology, and the social attitudes proper 
which are far more numerous and which consti- 
tute the distinctive subject-matter of the new 
science. 

Following closely in the same vein of thought, 
Faris (1925) proposed additional refinements. 
He would distinguish between conscious and 
unconscious attitudes, between mental and mo- 
tor attitudes, between individual and group at- 
titudes, and between latent and kinetic atti- 
tudes. Park (see Young, 1931), who is likewise 
in essential agreement with this school of 
thought, suggests four criteria for an attitude: 


l. It must have definite orientation in the 
world of objects (or values), and in this respect 
differ from simple and conditioned reflexes. 

2. It must not be an altogether automatic 
and routine type of conduct, but must display 
some tension even when latent. 

3. It varies in intensity, sometimes being 
predominant, sometimes relatively ineffective. 

4. It is rooted in experience, and therefore 
is not simply a social instinct. 


Bernard (1930) has recently prepared a synthesis 
of the conceptions found in current sociological 
writing: 


Social attitudes are individual attitudes directed 
toward social objects. Collective attitudes are 
individual attitudes so strongly interconditioned 


by collective contact that they become highly 
standardized and uniform within the group. . . - 
The attitude is originally a trial response, ie. 
interrupted, preparatory or substitute behavior 
arising within an incompleted adjustment re- 
sponse, but it may become the permanent set 
of the organism. It ranges from concrete mus- 
cular response to that which is abstract, inner 
or neural. . . . Attitudes form the basis of all 
language and communication. In them is im- 
plicit all finished “social behavior and through 
them practically all social adjustment is con- 
summated. . . . Public opinion is the highest 
form of collective attitudes. 


Conclusion 

This brief review of the history of the con- 
cept of attitude has established three important 
facts: 


1. After the breakdown of intellectualistic 
psychology the phenomena of "determination" 
came slowly but certainly to be admitted to un- 
questioned standing in experimental psychol- 
ogy. Attitudes came into fashion. 

9. Under the influence of psychoanalytic the- 
ory the dynamic and unconscious character of 
attitudes became more fully recognized. 

3. In sociological writing there was a gradual 
turning of interest to attitudes considered as the 
concrete representations of culture. 


The effect of these three convergent trends 
within the past fifteen years has been the crea- 
tion of a vigorous doctrine of attitudes, which 
today is bearing most of the descriptive and ex- 
planatory burdens of social psychology. Whether 
the concept is being overworked to such an €x- 
tent that it will be discarded along with the 
past shibboleths of social science remains to be 
seen. It seems more probable that the ever 
increasing number of critical and analytical 
studies will somehow succeed in refining and 
preserving it. 


ATTITUDES AS A FORM OF READINESS 


Let us now consider a representative selection 
of definitions and characterizations of attitude. 


[An attitude is] readiness for attention or action 
of a definite sort (Baldwin, 1901-1905). 
Attitudes are literally mental postures, guides 
for conduct to which each new experience is re- 
ferred before a response is made (Morgan, 1934, 
p. 47). 


ATTITUDES 7 


Attitude = the specific mental disposition to- 
ward an incoming (or arising) experience, 
whereby that experience is modified, or, a con- 
dition of readiness for a certain type of activity 
(Dictionary of Psychology, Warren, 1934). 


An attitude is a complex of feelings, desires, 
fears, convictions, prejudices or other tenden- 
cies that have given a set or readiness to act to 
a person because of varied experiences (Chave, 
1928). 


.a more or less permanently enduring state 
of readiness of mental organization which pre- 
disposes an individual to react in a character- 
istic way to any object or situation with which 
it is related (Cantril, 1934). 


From the point of view of Gestalt psychology a 
change of attitude involves a definite physio- 
logical stress exerted upon a sensory field by 
processes originating in other parts of the ner- 
vous system (Kóhler, 1929, p. 184). 

An attitude is a tendency to act toward or 
against something in the environment which be- 
comes thereby a positive or negative value (Bo- 
gardus, 1931, p. 62). 

By attitude we understand a process of individ- 
ual consciousness which determines real or pos- 
sible activity of the individual counterpart of 
the social value; activity, in whatever form, is 
the bond between them (Thomas and Znaniecki, 
1918, p. 27). 

The attitude, or preparation in advance of the 
actual response, constitutes an important deter- 
minant of the ensuing social behavior. Such 
neural settings, with their accompanying con- 
sciousness, are numerous and significant in so- 
cial life (F. H. Allport, 1924, p. 320). 

An attitude is a mental disposition of the hu- 
man individual to act for or against a definite 
object (Droba, 1933). 

[An attitude] denotes the general set of the or- 
ganism as a whole toward an object or situation 
which calls for adjustment (Lundberg, 1929). 
[Attitudes] are modes of emotional regard for 
objects, and motor “sets” or slight, tentative re- 
actions toward them (Ewer, 1929, p. 136). 

An attitude, roughly, is a residuum of experi- 
ence, by which further activity is conditioned 
and controlled. . .. We may think of attitudes 
as acquired tendencies to act in specific ways 
toward objects (Krueger and Reckless, 1931, 
p. 238). 

When a certain type of experience is constantly 
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repeated, a change of set is brought about 
which affects many central neurons and tends 
to spread over other parts of the central ner- 
vous system. These changes in the general set 
of the central nervous system temper the process 
of reception. . . . In terms of the subjective 
mental life these general sets are called attitudes 
(Warren, 1922, pp. 360 f.). 


An attitude is a disposition to act which is built 
up by the integration of numerous specific re- 
sponses of a similar type, but which exists as 
a general neural "set," and when activated by 
a specific stimulus results in behavior that is 
more obviously a function of the disposition 
than of the activating stimulus, The important 
thing to note about this definition is that it 
considers attitudes as broad, generic (not simple 
and specific) determinants of behavior (G. W. 
Allport, 1929). 

We shall regard attitudes here as verbalized or 
verbalizable tendencies, dispositions, adjust- 
ments toward certain acts. They relate not to 
the past nor even primarily to the present, but 
as a rule, to the future. Sometimes, of course, it 
is a hypothetical future. . . . The "attitude" is 
primarily a way of being "set" toward or against 
things (Murphy and Murphy, 1931, p. 615). 


It is not difficult to trace the common thread 
running through these diverse definitions. In 
one way or another each regards the essential 
feature of attitude as a preparation or readiness 
for response. The attitude is incipient and pre- 
paratory rather than overt and consummatory. 
It is not behavior, but the precondition of be- 
havior. It may exist in all degrees of readiness 
from the most latent, dormant traces of forgot- 
ten habits to the tension or motion which is 
actively determining a course of conduct that 
is under way. 


A Definition of Attitudes 


It is not easy to construct a definition suffi- 
ciently broad to cover the many kinds of atti- 
tudinal determination which psychologists today 
recognize, and at the same time narrow enough 
to exclude those types of determination which 
are not ordinarily referred to as attitudes. The 
definitions considered above contain helpful 
suggestions, and yet none alone is entirely satis- 
factory. The chief weakness of most of them 
seems to be their failure to distinguish between 
attitudes, which are often very general, and 
habits, which are always limited in their scope. 


Any attempt at a definition exaggerates the 
degree of agreement which psychologists have 
reached, but is justified if it contributes toward 
securing greater agreement in the future. The 
following definition has the merit of including 
recognized types of attitudes: the Aufgabe, the 
quasi-need, the Bewusstseinslage, interest and 
subjective value, prejudice, stereotype, and even 
the broadest conception of all, the philosophy 
of life. It excludes those types of readiness which 
are expressly innate, which are bound rigidly 
and invariably to the stimulus, which lack flex- 
ibility, and which lack directionality and refer- 
ence to some external or conceptual object. 4n 
attitude is a mental and neural state of readi- 
ness, organized through experience, exerting a 
directive or dynamic influence upon the indi- 
vidual's response to all objects and situations 
with which it is related. 


POSITIVE AND NEGATIVE ATTITUDES 


An attitude characteristically provokes behav- 
ior that is acquisitive or avertive, favorable or 
unfavorable, affirmative or negative toward the 
object or class of objects with which it is re- 
lated. This double polarity in the direction of 
attitudes is often regarded as their most distinc- 
tive feature, It has a central place in Bogardus' 
definition (1931, p. 52): “An attitude is a ten- 
dency to act toward or against some environ- 
mental factor which becomes thereby a positive 
or negative value." Likewise, Thurstone defines 
an attitude as "the affect for or against a psy- 
chological object" (1932). 

This point of view is a modern version of an 
ancient dialectic. For centuries the opposed cat- 
egories of “attraction” and “repulsion” have in 
one form or another played a decisive part in 
psychological theory. Empedocles assumed as 
the explanation of all activity the two contrast- 
ing immaterial principles of Love and Hate. 
The same opposed forces are prominent in the 
psychological theories of Mantegazza, Brentano, 
and Lindworsky. On a physiological plane one 
again encounters the dialectic of attraction and 
repulsion in the opposition of the flexors and 


. extensors (Sherrington), in facilitation and in- 


hibition (Münsterberg), in resistance and con- 
ductance (Troland), in outreaching and with- 
drawing behavior (Watson), in alliance and 
combat (Tarde), in acquisitive and avertive ten- 
dencies (Kempf), in adient and abient responses 
(Holt), and in pleasure and pain. One recent 
textbook of social psychology bears the subtitle, 


The Psychology of Attraction and Repulsion 
(Smith, 1980), and the same pair of concepts 
underlie the sociological system of Roguin 
(1931, 1982). It is no wonder that many writers 
find it possible to classify all attitudes as either 
positive or negative. It is undoubtedly true that 
the majority fit easily into these categories. 
And yet some attitudes are not readily classi- 
fied. What shall one do, for example, with a de- 
tached, impersonal, or judicial attitude, or with 
an attitude of neutrality? Complacency, amuse- 
ment, tolerance, and openmindedness are not 
easily reduced to “affect for or against” an ob- 
ject. Two bridge-players may have the same 
“degree of affect” toward the game, and yet dif- 
fer qualitatively in their attitudes toward it. 
Two radicals may be equally in favor of change, 
but disagree in the modus operandi of reform. 
Two ‘people equally well disposed toward the 
church may differ in their sacramental, litur- 
gical, esthetic, social, Protestant, or Catholic in- 
terpretation of the church. Is the degree of 
positive or negative affect aroused by the con- 
cept of “God” as significant as the qualitative 
distinctions involved in theistic, deistic, pan- 
theistic, agnostic, intellectualistic, or emotional 
attitudes? When one speaks of attitudes toward 
sex, it is obviously only the qualitative distinc- 
tions that have any intelligible meaning. What 
is a "serene and benevolent mind"? Certainly 
not one devoid of attitudes, nor yet one that is 
a battle-ground of tendencies “for” and tenden- 
cies “against.” All of these objections to the 
unidimensional view argue strongly for the rec- 
ognition of the qualitative nature of attitudes. 
There is, however, one way of meeting these 
objections, namely, by reducing attitudes to 
small enough components. If they are divided 
up into artificialized units, the unidimensional 
conception is saved. The two radicals, for exam- 
ple, who are equally “against” the present social 
system, but who differ in their policies, may 
conceivably be compared in respect to the at- 
traction or repulsion they show for each of the 
disputed policies. The bridge-enthusiasts who 
differ in their attitudes toward the game can 
be compared quantitatively in their attitudes 
for or against conversation during the play. 
Church-goers may be found to vary quantita- 
tively in the degree to which they favor every 
specific practice: baptism by immersion, intinc- 
ture, genuflection, or the use of vestments. And 
even the man who has a neutral attitude may 
be found to have a positive and measurable at- 
tachment to the ideal of neutrality. If such rigid 
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analyses are pursued, all of the complex, qual- 
itative attitudes can be broken down and mea- 
sured in fragments. The price one must pay for 
bi-polarity and quantification in such :cases is, 
of course, extreme, and often absurd, elemen- 
tarism. 


THE MEASUREMENT OF ATTITUDES 


The interest of American social psychologists 
in fact-collecting and statistical methods has re- 
sulted in a rapid advance in the empirical study 
of attitudes, with the result that attitudes today 
are measured more successfully than they are 
defined. As has often been pointed out, the situ- 
ation is not unlike that in the field of intelli- 
gence testing where practicable tests are an 
established fact, although the nature of intelli- 
gence is still in dispute. In recent’ years there 
has been a decline of interest in the measure- 
ment of intelligence (Goodenough, 1934) and 
an increase of interest in the measurement of 
attitudes. It seems as though militant testing, 
having won victories on one field of battle, has 
sought a new world to conquer. The numerous 
methods available for measuring attitudes have 
been often reviewed and do not require restate- 
ment here. The present abbreviated account, 
which confines itself to three methods, may be 
supplemented by the more complete summaries 
of Bain (1930), Droba (1932, 1934a), Katz and 
Allport (1931), Fryer (1931), Murphy and Mur- 
phy (1931), Stagner (see Black, 1933, pp. 115- 
127), Sherman (1932), and Symonds (1931). 


The Census of Opinions 


The simplest method for determining how 
common an attitude (really an opinion) may 
be in a certain population is by counting bal- 
lots or by tabulating answers to a question- 
naire. Roughly, this method may be said to 
"measure" the range and distribution of pub- 
lic opinion, although it does not, of course, 
determine the intensity of the opinion of any 
given individual upon the issue in question. 
The application of this method may be illus- 
trated by reference to a recent poll, widely 


` reported in the newspapers, concerning pacifis- 


tic and militaristic attitudes among 22,627 stu- 
dents in seventy colleges. Thirty-nine per cent 
of these students declared that they would 
participate in no war whatsoever, 33 per cent 
would take part only if the United States 
were invaded, and 28 per cent were ready to 
fight for any cause that might lead the nation 
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to declare war. A critic might remark that 
such a result expresses only “verbal” opinion, 
or at the most merely temporary attitudes, 
which would change under the pressure of 
propaganda. Whatever may be the force of this 
objection, it applies equally to all of the meth- 
ods now existing for determining the strength 
and nature of personal attitudes. 

A far more elaborate census of students’ 
attitudes was made by Katz and Allport (1931). 
In this study 4248 students in Syracuse Uni- 
versity responded to a questionnaire contain- 
ing many hundreds of items. The students did 
not write their opinions, but under each topic 
checked one of several alternative opinions 
with which they felt themselves to be most 
closely in agreement. Obviously this method 
does not provide a true scale of measurement, 
since the alternative items are not scaled in 
respect to their intensity. The results obtained, 
however, can be turned readily into a study 
of the percentages of students who favor each 
of the opinions contained in the questionnaire. 


The a priori Scale 


The so-called a priori scale is essentially a 
test devised on the basis of logical rather than 
empirical considerations. It is an economical 
method, widely used, and easy to apply, but 
in recent years it has been severely criticized. 
There are various forms of the a priori scale, 
but they are all alike in that their scoring is 
arbitrary. Sometimes the author presents a se- 
ries of questions, each of which may have, say, 
five alternative answers from which the subject 
must select one, These alternative answers are 
conceived by the author to lie on a single con- 
tinuum and to be equally spaced from the most 
favorable to the least favorable. To each item 
the author arbitrarily assigns a value of 1, 2, 
3, 4, or 5, according to his opinion of its sig- 
nificance. Another variation allows the subject 
to place in rank order all of the alternatives 
according to his preference; these rank orders 
are then treated as though they were equal 
intervals in the scale. The statistical pitfalls 
of a priori scales have been pointed out by 
Thurstone (1927-1928). 

As an example of a widely used scale of this 
type may be mentioned the test for "social 
distance" devised by Bogardus (1925a, 1925b, 
1927). In this test the subject is asked the 
degree of intimacy he would willingly sanction 
between himself and members of various races. 
The degrees of intimacy listed in one form of 


the test together with their "scale values" are 
as follows: 

1—to close kinship by marriage 

2—to my club as personal chums 

$—to my street as neighbors 

4—to employment in my occupation in my 

country 

5—to citizenship in my country 

6—as visitors only to my country 

7—would exclude from my country 


The weakness of the scale becomes at once 
apparent when it is realized that the distance 
between each of these degrees of intimacy is 
not necessarily comparable. The psychological 
difference between relationship in marriage 
and in a club is likely to be far greater than 
that existing between club relations and neigh- 
borly relations. It becomes therefore mislead- 
ing to assign equally progressing arithmetical 
units to unequal attitudinal differences. An- 
other difficulty arises in the assumption that 
each higher degree of intimacy necessarily im- 
plies all those that are lower; but there are 
cases where admission to neighborly relations, 
for example, is less distasteful than admission 
to one's occupation. 


The Psychophysical (Rational) Scale 


The most significant event in the history of 
the measurement of attitudes was the applica- 
tion of psychophysical methods by Thurstone. 
To apply psychophysical methods it is necessary 
first to conceive of an attitude as a “degree of 
affect” for or against an object or a value with 
which the scale is concerned. If this assumption 
is granted, it becomes possible to study the 
degree of favor or disfavor which each sub- 
ject in a population has toward certain ob- 
jects or values, such as the church, war, 
moving pictures, or government ownership. 
Within the past few years a large number 
of such scales have been devised and made 
available for general use (Thurstone and asso- 
ciates, 1929). 

The scoring values for all of these scales 
are determined by combining the efforts of 
many judges who have arranged all the state- 
ments included in each scale according to their 
discriminable differences. If judges, by and 
large, agree that two statements express about 
the same degree of favor or disfavor it is obvi- 
ously unnecessary to keep both statements in 
the scale; if the statements are widely different 
it is possible by comparing the judges’ sorting 


of each statement in relation to all other state- 
ments to determine its position. The final, ra- 
tional scale results when forty or fifty state- 
ments are secured whose distance from one 
another on a single continuum are known. 
This distance is essentially the discriminable 
difference between the statements as they ap- 
pear to the standardizing group of judges. 
There are various methods by which the dis- 
criminable differences may be determined. The 
commonest is the “method of equal-appearing 
intervals.” The directions for its use involve 
the following steps: 


l. Specify the attitude variable to be mea- 
sured. 

2. Collect a wide variety of opinions relat- 
ing to it, from newspapers, books, or from 
individuals. 

3. Assemble on cards approximately one 
hundred such typical opinions. 

4. Require at least 200-800 judges to sort 
these cards into piles (eleven being a conve- 
nient and commonly employed number), each 
pile representing equidistant degrees of the 
attitude according to each judge's estimation. 

5. Calculate the scale value for each of the 
items by computing the median of the scale 
values assigned to it by the judges, and the 
dispersion of the judgments around the median. 

6. Retain such statements as have small dis- 
persions, and are on the whole equally spaced. 
Give approximately equal representation to 
each of the intervals secured. Clarity and brev- 
ity of wording may furnish additional bases 
for selection. 

7. In applying the scale, the subject checks 
every statement with which he agrees, and his 
score is the mean scale-value for all the state- 
ments he has endorsed. 


The most useful procedure in constructing 
such scales is to follow the models offered 
in the Thurstone-Chave (1929) or Peterson- 
Thurstone (1933) scale. Directions for uniform 
wording have been suggested by Droba (1932), 
Wang (1932), Kulp (1933), and Stagner (see 
Black, 1933, pp. 115-127). Further details con- 
cerning the construction and use of psycho- 
physical scales may easily be traced through 
the literature (Black, 1933; Dockeray, 1932; 
Thurstone, 1927-1928, 1929, 1932; Remmers, 
1934). 

As revolutionary as the rational scale un- 
doubtedly is, certain criticisms must be made 
against the method as it is at present employed: 


ATTITUDES n 


l. As has already been indicated above, at- 
titudes are not necessarily arranged naturally 
upon a single continuum; they are often dis- 
crete and highly individual (cf. Katz and All- 
port, 1931). 

2. There is also the question whether scale 
values for statements derived from one popu- 
lation of judges is applicable to other popu- 
lations of subjects (Rice, 1930). For example, 
can the judgments of adults concerning the 
significance of a statement dealing with moving 
pictures be incorporated in a test that is to 
be administered to children? 

3. Likert (1932-1933) has shown that the 
simple a priori method of scoring in arbitrary 
units (1 to 5) when applied to these rational 
scales may yield results as reliable as do the 
psychophysical scores themselves. The agree- 
ment between the two methods is approxi- 
mately .90. This fact may give comfort to inves- 
tigators who wish to avoid the more complex 
procedures. 


Thurstone’s strictures upon the logic of a 
priori scales are undoubtedly sound, but they 
do not necessarily invalidate these scales when 
only practical results are desired. Suppose, for 
instance, that a psychologically minded chair- 
man wishes to determine at a certain meeting 
the temper of his audience in reference to 
some issue under discussion. He can quickly 
prepare and quickly (if roughly) score an 
a priori scale; whereas the preparation of a 
more carefully standardized test would be im- 
practicable and unnecessarily fine-grained for 
his purposes. 


Conclusions 


The success achieved in the past ten years in 
the field of the measurement of attitudes may 
be regarded as one of the major accomplish- 
ments of social psychology in America. The 
rate of progress is so great that further achieve- 
ments in the near future are inevitable. But 
there are inherent limitations in all methods 
of testing. Unless these are kept in mind the 
zeal for measurement may overstep reasonable 
bounds. 


1. Measurement can deal only with attitudes 
that are common, and there are relatively few 
attitudes that are common enough to be profit- 
ably scaled. In forcing attitudes into a scale 
form violence is necessarily done to the unique 
structure of man’s mind. Attitude scales should 
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be regarded only as the roughest approxima- 
tions of the way in which attitudes actually 
exist in the mental life of individuals. 

2. Each person possesses many contradictory 
attitudes, and for this reason his mental set at 
the moment of submitting to a scale may tell 
only a part of the story. Furthermore, attitudes 
often change, and an investigation made under 
one set of conditions may not for long present 
a true picture of the attitudes of any given 
group. Stagner (see Black, 1933) reports a meet- 
ing of farmers in a village in northern Wis- 
consin who, under the influence of a persuasive 
speaker, voted unanimously one afternoon to 
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2. The Measurement of Social Attitudes 


L. L. THURSTONE 


It is an honor and a privilege for me to have 
this opportunity of addressing the Midwestern 
Psychological Association. I wish that I could 
do justice to the occasion and express my ap- 
preciation by an address that is worth listening 
to. I have selected among the few subjects 
that are available to me one that may be of 
fairly general interest while it still involves 
many theoretical and psychological problems. 
I shall discuss the measurement of social atti- 
tudes. In doing so I shall review the develop- 
ment of the measurement methods that are 
applicable to attitudes and I shall also discuss 
some of the criticisms and questions that have 
recently been raised about this subject. 
Several years ago when I was teaching con- 
ventional psychophysics, it seemed to me that 
psychophysics was really a very dull subject 
in spite of the fact that it did offer the satis- 
faction of clean and quantitative logic. This 
type of satisfaction is rare in psychological in- 
vestigation and consequently psychophysics has 
stood out as a very dignified topic in psychol- 
ogy in spite of the fact that its intrinsic subject 
matter has been, on the whole, rather trivial. 
These depreciative statements about psycho- 
physics can be readily amplified by referring 
to the conventional publications in this sub- 
ject. You will then find that one of the elabo- 
rate parts of the subject is the determination 
of limens. There is a great deal of hairsplitting 
about just how a limen should be determined 
with the greatest possible precision. In deter- 
mining a limen you fit a phi-gamma curve and 
then there is more hairsplitting as to whether 
you should adjust the errors of observation in 
the proportions or in the stimulus magnitudes. 


* Reprinted from Journal of Abnormal and Social 
Psychology, 1931, 26, 249-269, with permission of 
the American Psychological Association. This arti- 
cle was originally presented as an address for the 
Midwestern Psychological Association, May 9, 1931. 
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Then you will find the several psychophysical 
methods compared as to which gives the most 
reliable limen determination. And then you 
can find short-cuts for these methods by which 
you can determine somebody's limen very 
quickly when you are in a hurry for a limen. 
Now, it seems strange that I have never seen 
a psychologist who really cared much about 
any particular person’s limen for anything! 

I venture the guess that not more than per- 
haps half a dozen psychologists in this room 
have ever needed or wanted somebody's limen 
for anything with a high degree of precision. 
And I don't believe that the proportion would 
be much higher in a group of Eastern psychol- 
ogists. 

Of course we are interested to know the 
order of magnitude of errors in visual dis- 
crimination as compared with those of various 
forms of auditory discrimination. Even in a 
rough pitch discrimination test we are deter- 
mining a limen in a sense, but these problems 
never involve any profundities of curve fitting 
for the limen of an individual subject. Why 
then does psychophysics bother so much about 
methods, and shortcuts for these methods, 
which are never used on individual subjects 
except when an individual serves as a speci- 
men for some type of situation in which the 
methods must be adapted to the conditions of 
each problem anyway? This bothered me also 
in teaching the subject and I came to share 
the distrust of my students in the significance 
of the whole subject. 

One way in-which to retain the satisfactions 
that can be found in the logic of this subject 
is to change its content. We have tried this 
and it has seemed to some of us that psycho- 
physics thereby takes on an entirely new aspect. 
Instead of asking a person, "Which of these 
two little cylinders is the heavier?" (apologies 
to Mr. Boring for the stimulus error)—instead 
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of asking a person which of two cylinders is 
the heavier, we might as well ask him some- 
thing interesting, such as, “Which of these two 
nationalities do you in general prefer to asso- 
ciate with?” or, “Which of these two offenses 
do you consider to be in general the more 
serious?” or, “Which of these two pictures or 
colored designs do you like better?” Questions 
of this sort of discrimination might be multi- 
plied indefinitely and if they could be handled 
with some sort of psychophysical logic it is 
clear that we should have here the possibilities 
of objective description of more psychological 
significance than the sensory limen. 

The first objection that I encountered is that 
the very term psychophysics should be strictly 
limited to the field of sensory discrimination 
and that the questions that I prefer to ask 
should not be regarded as of the dignity of 
psychophysics—they should be relegated to the 
field of mental tests and questionnaires. I find 
justification for extending the use of psycho- 
physics even to questions that are interesting 
by referring to Titchener. He says: “Fechner 
was chiefly interested in the intensive aspect 
of mental processes, and among mental proc- 
esses in sensation. His example has led other 
inquirers to give a disproportionate amount of 
attention to the laws of sensation intensity.” 
It is unfortunate that Mr. Titchener did not 
happen to ask his students to judge the rela- 
tive merits of handwriting specimens or of 
English compositions or to make social judg- 
ments because then I am sure that we should 
now have the permission to use psychophysical 
methods in a wide variety of problems. 

Furthermore, if, instead of printing our 
questions on a piece of paper we should rig 
up an elaborate automatic contraption for ex- 
posing these questions, running the said con- 
traption by an electric motor, and spreading 
plenty of kymographs and telegraph keys and 
speech keys and time markers all around the 
room, then I am sure that our studies would 
qualify as experimental psychology. But since 
we ask the subject to indicate his response 
with a pencil instead of by a telegraph key, 
our investigations have been outside the pale 
of experimental psychology. e 

When the constant method is used in its 
complete form so that every stimulus serves in 


turn as a standard, then it becomes the method 


of paired comparison. There was no quanti- 
tative logic for handling the method of paired 
comparison so as to obtain measurement which 


satisfied the criterion of internal consistency. 
This difficulty was overcome by finding an 
equation that satisfied this criterion. It has 
been referred to as the law of comparative 
judgment. With this rational equation and the 
method of paired comparison we have made 
several studies involving social stimuli and in 
which the subjects were asked to express vari- 
ous kinds of judgments other than mere com- 
parison of physical magnitudes. 

In one of these experiments a list of twenty 
nationalities was presented to several hundred 
students (1). The nationalities were arranged 
in pairs so that every nationality was paired 
with every other one in the list. The students 
were asked to underline one nationality of each 
pair to indicate which of the two nationalities 
they would rather associate with. The returns 
were tabulated in the form “proportion of the 
subjects who prefer nationality A to nationality 
B.” With these experimental proportions and 
by the law of comparative judgment, the scale 
separation was calculated for each pair. By 
means of these data it was possible to construct 
a linear scale of attitude to which each na- 
tionality was allocated. At the top of the list 
is the American, next come other English 
speaking countries, and at the bottom of the 
list are nationalities or races other than our 
own. This order is what one should expect, 
but the scale values could not be predicted. 

In general the scaling is accomplished on 
the principle that if the group of subjects very 
generally prefers A to B, then the proportion 
of the subjects who vote for A will be high, 
perhaps close to unity (2). If, on the other 
hand, the two nationalities are about equally 
well liked by the group, then there will be 
about as many subjects who vote for A as there 
are subjects who vote for B. Hence, the pro- 
portion above described will be close to .50 
and the two nationalities will have zero sepa- 
ration, that is, they will have the same scale 
value. 

In an experiment of this sort the criterion 
of internal consistency consists in the discovery 
that by assigning one scale value to each of 
the nationalities we can reconstruct all of the 
experimentally independent proportions. With 
twenty nationalities in the list we have twenty 
scale values and these must be sufficient to lock 
the 190 experimentally independent propor- 
tions within the known probable errors of the 
given proportions. 

One of the criticisms of this procedure has 
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been that the entries in the list are not true 
nationalities. For example, in the list occur 
such entries as Jew, Negro, South American. 
It is not necessary to restrict ourselves to ac- 
cepted anthropological classifications in these 
experiments. We are measuring the degree of 
affect for or against the social objects listed. 
This is legitimate even if some of the classifi- 
cations are races rather than nationalities, or 
religions, or groups of nationalities. For exam- 
ple, it is conceivable that some of the students 
disliked South Americans in general without 
knowing much about them and without stop- 
ping to debate whether these South Americans 
whom they disliked were one or twenty na- 
tionalities. The category is a conversational one 
which lends itself to the expression of affect 
and it therefore serves our purposes even if 
the psychological or affective category does not 
fit the accepted anthropological or political 
classifications. There is of course nothing to 
prevent the use of other classifications so long 
as the categories lend themselves to the expres- 
sion of the likes and dislikes of people gener- 
ally. 

Another criticism that has been offered 
against experiments of this type is that it would 
make a difference if the question were worded 
differently. For example, in the form here de- 
scribed the question was “Which of these two 
nationalities or races would you rather associate 
with?” Now, so runs the objection, what would 
happen if the question were “Which would 
you rather have as a fellow student?", “Which 
would you rather have for a neighbor?', 
"Which would you rather have your sister 
marry?" and "Which would you rather do busi- 
ness with?" (3). Fortunately, this question could 
be answered by an experiment. Fifteen hundred 
blanks were used in which there were three 
hundred blanks with each of the five questions. 
"These were arranged in random order so that 
it was a matter of chance which of the five 
forms was given to each of the fifteen hundred 
subjects in the experiment. When the blanks 
had been filled in, they were sorted out into 
five piles according to the question on the 
blank. The twenty nationalities were then allo- 
cated to a scale separately for each of the five 
questions. The twenty nationalities were found 
to be in the same rank order in the scales 
which were constructed on the basis of the five 
questions. This proves that the scale value of 
a nationality is determined primarily not by 
the detailed form of the question but rather by 


the general degree of like or dislike of the sub- 
jects for each nationality. This general like or 
dislike is what we have called the potential 


„action toward the object or attitude. 


On the other hand, if the question had been 
“Which nationality would you rather have as 
a servant?" it is quite conceivable that the 
scale values might have been different, because 
such a question is not calculated to bring out 


. the attitude of the subject toward the national- 


ity with regard to social equality. If the ques- 
tion asked which of the nationalities is the 
more intelligent, which is physically superior, 
which is more emotional, which is the taller, 
it is quite certain that the scale values would 
have been different. 

The generalization of the last experiment 
might be made by noting that a subject could 
be half way through the blank, responding by 
underlining one of each pair, and stil not 
remember what the question was that he was 
answering by his underlining. This means 
merely that the subject gets a set of checking 
preferences without recalling the cognitive de- 
tail of the question. The question has then 
served its usefulness by giving the subject a 
set of expressing his attitudes rather than some 
intellectual judgment about the nationalities 
in question. The scale value of each nationality 
measures the affective value of the nationality 
for the group of subjects. 

It is obvious that the scale value of each 
nationality in these experiments is a descrip- 
tion of the group of subjects as much as it is 
a description of the nationalities. If the same 
experiment were repeated with Italian students 
or with Russian students, the scale values of 
the nationalities would undoubtedly be radi- 
cally different. 

This suggests the possibility of measuring 
cultural similarities or dissimilarities. Suppose 
that these paired comparison schedules were 
filled in by university students in ten different 
countries. The scale value of each of the 
twenty nationalities would be calculated sepa- 
rately for each of the ten countries. We could 
then calculate the correlation coefficient for the 
scale values of the twenty nationalities in two 
countries, for example, the German and the 
French students. If the correlation coefficient 
were high it would indicate a similarity in the 
nationality preferences of the two groups of 
students. If the correlation were low it would 
measure dissimilarity of the two groups of stu- 
dents as to their national preferences. If coun- 
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try C is very much hated by A and liked by B, 
then that difference would be measurable by 
the difference in the scale value for nationality 
C in the scales for groups A and B. In this 
way, international affiliations and antipathies 
might be described in a quantitative manner. 
Of course differences might appear in different 
occupational groups and in different regions 
of the same country. 

Suppose that one group of subjects likes all 
of the twenty nationalities about equally well. 
Then the twenty nationalities would have the 
same scale value. Suppose that another group 
of subjects has very decided likes and dislikes 
among the twenty nationalities. Some of the 
nationalities would then have high scale values 
while others would have conspicuously low 
scale values. In other words, the spread in scale 
values would be much greater for the second 
group than for the first group. The intoler- 
ance of a group is measured by the spread in 
scale values. In this manner it would be pos- 
sible to measure the tolerance of different coun- 
tries for other countries. These measurements 
of international tolerance might conceivably 
have considerable social interest. 

Some of our experiments have been set up 
so as to measure the effect of social stimuli on 
the international attitudes of high school chil- 
dren. We have worked with a number of mo- 
tion picture films as stimuli (4). In Genoa, Illi- 
nois, the film Four Sons was shown in the local 
theater and 131 children in grades 7 to 12 in- 
clusive were given free tickets. Several days 
before the performance they were asked to fill 
in a paired comparison schedule of nationality 
preferences. The Germans were included in 
this list. The morning after the performance 
the children again filled in the same schedule. 
It was assumed that the attitudes of the chil- 
dren did not change toward the twenty na- 
tionalities during the course of a week or ten 
days between the two schedules except for the 
possible effect of the film on their attitude 
toward Germans. The film made the children 
much more friendly toward Germans. This 
experiment and others of the same general 
type demonstrate that the effect of a single 
social stimulus on the international attitudes 
of the subjects can be measured. 

The same general technique has been used 
for measuring attitude toward crimes (5). In 
such experiments a list of crimes was arranged 
in the same paired forms and the subjects 
were asked to indicate for each pair which of 


the two crimes they considered to be the more 
serious. In this manner it was found in one 
town that the attitudes of children toward gam- 
bling were considerably affected by seeing the 
picture Street of Chance. The film had the 
effect of making the children regard gambling 
as a more serious offense after seeing the pic- 
ture. These experiments with motion pictures 
have been effectively conducted by Miss Ruth 
Peterson. 

The question has been raised as to the de- 
gree of permanence of these effects. In order 
to answer this question we have repeated the 
schedules of comparison in several towns after 
an interval of four to five months. The atti- 
tudes have returned about half way toward 
their original values in four months, but these 
effects vary of course with the film used and 
the frequency of other social stimuli. In one 
town the effect lasted without diminution for 
five months. 

When a series of social stimuli has been allo- 
cated to an affective continuum by the method 
of paired comparison we have the scale sepa- 
rations but we do not have a rational origin 
for the affective continuum. This is a problem 
for which Mr. Horst has found a very inge- 
nious solution. It is of some psychological inter- 
est to locate a series of stimuli to an affective 
continuum in such a manner that the measure- 
ments refer to a datum of affective neutrality. 
The solution by Mr. Horst consists in asking 
a group of subjects to compare one stimulus A 
which is likely to be favorably regarded with 
another stimulus B which is likely to be con- 
sidered unfavorable. The subject is asked this 
question, “Would you be willing to endure 
the disadvantage B in order to have the advan- 
tage A?" The proportion of subjects who are 
willing to accept B in order to have A allocates 
the affective origin between the two stimuli. 
The same procedure can of course be extended 
to a whole series of stimuli so that the location 
of an affective datum in the series can be tested 
by the criterion of internal consistency. 

Another method of measuring attitude is to 
use a statement scale (6). This consists in a 
series of opinions which are submitted to the 
subject for endorsement or rejection. These 
statements or opinions have been so selected 
that they constitute an evenly graduated series 
and so that a scale value can be given to each 
opinion. If the opinions A, B, C, D are four 
successive opinions in such an evenly graduated 
scale about prohibition, for example, then the 
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following conditions would be satisfied. If one 
person endorses opinion A and another person 
endorses opinion B, then a group of observers 
should find some difficulty in saying which of 
the two opinions is more favorable to prohi- 
bition. Let us suppose that three-fourths of the 
observers would say that opinion A is more 
favorable to prohibition than opinion B. Then 
this degree of difficulty in judging which of 
them is the more favorable to prohibition 
constitutes a measure of the separation between 
the two statements of opinion on the attitude 
scale. Now, if opinion C is so chosen that 
three-fourths of the observers say that B is 
more favorable to prohibition than C, then 
the scale separation between A and B is the 
same as the separation between B and C. In 
this manner a series of statements is selected 
from a large number so that the apparent incre- 
ment in attitude from one statement to the 
next is the same for the whole series. 

With a scale value assigned to each statement 
or opinion, it is of course easy to calculate 
the median scale value of all the statements 
that any given individual has endorsed. This 
median scale value is the score of that person 
on the attitude scale. The meaning of these 
scores can be illustrated further as follows. 
Suppose that three individuals, X, Y, Z, have 
attitude scores on prohibition which are equally 
spaced. Then the difference or increment in 
attitude between X and Y would seem to be 
the same as the difference in attitude between 
Y and Z. In other words, it would be just as 
difficult to discriminate between X and Y as to 
which is the more favorable to prohibition as 
it is difficult to discriminate between Y and Z. 
This is the basis for the construction of the 
attitude scales. The psychophysical experimen- 
tal methods by which the attitude scales are 
constructed so as to satisfy these requirements 
are beyond the scope of this paper. 

The statement scale enables one to make 
several types of measurement of which the fol- 
lowing are examples. 

The attitude of an individual subject can 
be measured by means of a statement scale. 
The paired comparison procedure enables us 
to compare groups of subjects but the state- 
ment scale procedure is preferable for meas- 
uring the attitudes of individual subjects. The 
range of statements that the individual en- 
dorses gives some indication of his tolerance. 
It is possible to plot a frequency distribution 
of the attitudes of a group of people toward 


labor unions, for example. This distribution 
has a central tendency or average and it has 
a measurable dispersion. Two groups of peo- 
ple may then be found to have the same aver- 
age score on a disputed issue but one of the 
two groups may be more heterogeneous than 
the other. The degree of heterogeneity in atti- 
tude of a group of people is directly measured 
by the standard deviation of the frequency dis- 
tribution of their attitude scores. This is an 
important aspect of group comparisons which 
can be reduced easily to measurement in terms 
of the dispersion of the scores. 

In two small towns, West Chicago and 
Geneva, Illinois, an experiment was arranged 
so that a film favorable to the Chinese was 
shown in one town and a film unfavorable to 
the Chinese was shown in the other town (7). 
The two films were Welcome Danger which is 
thought to be unfriendly to the Chinese and 
which has been so criticized by the Chinese 
themselves, and Son of the Gods which is gen- 
erally thought to be friendly in its interpreta- 
tion of Chinese culture. The films were shown 
in the local theaters and the children were 
given free tickets to the performances. In each 
town the children were asked to fill in a state- 
ment scale about the Chinese several days be- 
fore seeing the film and also the morning after 
seeing it. The results show a very decided shift 
in favor of the Chinese in Geneva where Son 
of the Gods was shown. In West Chicago there 
was a small opposite effect where the children 
saw the film, Welcome Danger. The effect of a 
single social stimulus, such as a motion picture 
film, on the international attitudes of school 
children can be described by the statement 
scale as well as by the paired comparison 
method. 

The statement scale is constructed by asking 
a group of one hundred judges to sort out à 
list of opinions into a series of eleven succes- 
sive piles to represent attitudes from one ex- 
treme to the other. The question was raised 
early in our experiments whether the attitudes 
of the judges themselves would influence the 
final scale values of the statements in the scale. 
For this reason Mr. Hinckley set up an experi- 
ment for measuring attitude toward the Negro. 
He had 114 statements about the Negro. At one 
extreme were the opinions that the Negro is 
the equal of the white man and should have 
equal social privileges. At the other extreme 
were the opinions that the Negro is inferior to 
the white man and should not have the same 
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social privileges. Three groups of judges were 
used, namely, one group of white college stu- 
dents friendly toward the Negro, one group of 
white college students who thought the Negro 
was definitely inferior, and one group of edu- 
cated Negroes. The whole list of 114 statements 
was scaled separately for the three groups of 
judges. The result was that the three scales so 
constructed were practically identical, thus 
proving that the attitudes of the judges have 
no serious effect on the measuring function of 
the statement scale. 

It is possible to apply the statement scale 
method to the measurement of social trend. 
This will be illustrated in terms of attitude 
toward the Germans and the French. A col- 
lection of quotations from newspaper editorials 
has been made by Mr. Russell. His quotations 
cover the twenty year period 1910-1930. A 
group of judges sorted the editorial quotations 
into a series of eleven piles ranging from No. 1 
expressing extreme admiration for the Germans 
to No. 11 expressing extreme contempt for them. 
The scale value of each quotation was calculated. 
Then the average scale value of all the quota- 
tions from the year 1910 was determined. It is 
the mean attitude toward the Germans for that 
year. The average scale value of all the quota- 
tions from year 1911 was determined and it is 
the mean attitude toward the Germans for that 
year. In this manner the mean attitude toward 
the Germans was represented quantitatively so 
that this social trend for a period of 20 years 
could be inspected in a single graph. 

For this particular issue the curve shows the 
expected depression in the average scale value 
for the Germans during the years of the war 
and a corresponding rise in average scale value 
for the French during the same period. Of 
course the curves so plotted represent only the 
attitudes of the editorials of one large news- 
paper. Mr. Russell is now making this type of 
inquiry for four newspapers, namely the New 
York Times, the World, the Chicago Daily 
News, and also the Chicago Tribune. The 
curves for these four newspapers are similar. 
They all show a return of attitudes toward the 
pre-war values with interesting deviations that 
correspond to popularly discussed issues at 
various times. The newspapers also show some 
differences in the rapidity with which their 
editorials return toward pre-war attitudes for 
Germany. We have here the possibility of mea- 
suring the changes in attitudes as represented 
in the press during past times even though the 


attitude scales were not available for these 
periods. It will be interesting to study by these 
quantitative methods the rapidity with which 
international attitudes have changed before 
and after each of the recent modern wars by 
analyzing the foreign press for a few years be- 
fore and after each war. It will also be of in- 
terest to correlate these rates of change with 
other social facts such as facility of communi- 
cation, similarity of language and culture, and 
the like. Perhaps this will be a psychophysical 
contribution to the methods of history. 

Many of the criticisms and questions that 
have appeared about attitude measurement 
concern the nature of the fundamental con- 
cepts involved and the logic by which the 
measurements are made. I shall consider a few 
of these questions briefly. 

One of the most frequent questions is that 
a score on an attitude scale, let us say the scale 
of attitude toward God, does not truly describe 
the person’s attitude. There are so many com- 
plex factors involved in a person's attitude on 
any social issue that it cannot be adequately 
described by a simple number such as a score 
on some sort of test or scale. This is quite true, 
but it is also equally true of all measurement. 

The measurement of any object or entity 
describes only one attribute of the object mea- 
sured. This is a universal characteristic of all 
measurement. When the height of a table is 
measured, the whole table has not been de- 
scribed but only that attribute which was 
measured. Similarly, in the measurement of 
attitudes, only one characteristic of the attitude 
is described by a measurement of it. 

Further, only those characteristics of an ob- 
ject can be measured which can be described 
in terms of “more” or “less.” Examples of such 
description are: one object is longer than an- 
other, one object is hotter than another, one 
is heavier than another, one person is more 
intelligent than another, more educated than 
another, more strongly favorable to prohibi- 
tion, more religious, more strongly favorable 
to birth control than another person. These 
are all traits by which two objects or two per- 
sons may be compared in terms of "more" or 
“less.” 

Only those characteristics can be described 
by measurement which can be thought of as 
linear magnitudes. In this context, linear mag- 
nitudes are weight, length, volume, tempera- 
ture, amount of education, intelligence, and 
strength of feeling favorable to an object. 


20 HISTORICAL FOUNDATIONS 


Another way of saying the same thing is to 
note that the measurement of an object is, in 
effect, to allocate the object to a point on an 
abstract continuum. If the continuum is weight, 
then individuals may be allocated to an ab- 
stract continuum of weight, one direction of 
which represents small weight while the oppo- 
site direction represents large weight. Each 
person might be allocated to a point on this 
continuum with any suitable scale which re- 
quires some point at which counting begins, 
called the origin, and some unit of measure- 
ment in terms of which the counting is done. 

The linear continuum which is implied in 
all measurement is always an abstraction. For 
example, when several people are described as 
to their weight, each person is in effect allo- 
cated to a point on an abstract continuum of 
weight. All measurement implies the reduction 
or restatement of the attribute measured to an 
abstract linear form. There is a popular fallacy 
that a unit of measurement is a thing—such 
as a piece of yardstick. This is not so. A unit 
of measurement is always a process of some 
kind which can be repeated without modifi- 
cation in the different parts of the measure- 
ment continuum. 

Not all of the characteristics which are con- 
versationally described in terms of “more” or 
“less” can actually be measured. But any char- 
acteristic which lends itself to such description 
has the possibility of being reduced to measure- 
ment. 

We admit that an attitude is a complex af- 
fair which cannot be wholly described by any 
single numerical index. For the problem of 
measurement this statement is analogous to 
the observation that an ordinary table is a 
complex affair which cannot be wholly de- 
scribed by any single numerical index. So is a 
man such a complexity which cannot be wholly 
represented by a single index. Nevertheless we 
do not hesitate to say that we measure the 
table. The context usually implies what it is 
about the table that we propose to measure. 
We say without hesitation that we measure a 
man when we take some anthropometric mea- 
surement of him. The context may well imply 
without explicit declaration what aspect of the 
man we are measuring, his cephalic index, his 
height or weight or blood pressure or what not. 
Just in the same sense we shall say here that 
we are measuring attitudes. We shall state or 
imply by the context of people's attitudes that 
we are measuring. The point is that it is just 


as legitimate to say that we are measuring atti- 
tudes as it is to say that we are measuring 
tables or men. 

Whenever a common word is adopted for 
scientific use, it nearly always suffers some re- 
striction in its connotation in favor of greater 
precision of meaning. This has happened in 
many sciences, so that it is by no means pecu- 
liar to psychological terms. Consider, for exam- 
ple, such words as elasticity, momentum, force, 
which are after all common ordinary words, 
but as they are used by the physicist they are 
very much restricted and more precise while 
still retaining the essential ordinary idea. So 
it is in psychology with terms like sensation, 
perception, illusion, meaning, idea, and con- 
cept. Now when we turn scientific logic and 
experimental psychophysical procedures to the 
subject of attitudes, we find it necessary to re- 
strict here also the rather loose conversational 
meaning of this term in order to make it at 
all suitable for scientific discourse. 

Our present definition of the term may be 
briefly stated as follows: Attitude is ‘the affect 
for or against a psychological object. Affect in 
its primitive form is described as appetition 
or aversion. Appetition is the positive form of 
affect, which in more sophisticated situations 
appears as liking the psychological object, de- 
fending it, favoring it in various ways. Aversion 
is the negative form of affect, which is decribed 
as hating the psychological object, disliking it, 
destroying it, or otherwise reacting against it. 
Attitude is here used to describe potential ac- 
tion toward the object with regard only to the 
question whether the potential action will be 
favorable or unfavorable toward the object. 
For example, if we say that a man’s attitude 
toward prohibition is negative, we mean that 
his potential actions about prohibition may be 
expected to be against it, barring compromises 
in particular cases. When we say that a man’s 
attitude toward prohibition is negative, we 
have merely indicated the affective direction 
of his potential action toward the object. We 
have not said anything about the particular 
detailed manner in which he might act. In this 
sense the term attitude is an abstraction in 
that it cannot be described without inserting 
the cognitive details that are irrelevant, but 
this is also true of many of the simplest con- 
cepts in daily use. 

The affect about an object may be of strong 
intensity, or it may be weak. The positive and 
negative affect therefore constitutes a linear 
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continuum with a neutral point or zone and 
two opposite directions, one positive and the 
other negative. Measurement along this affec- 
tive continuum is of a discriminatory character 
with the discriminal error as a unit of measure- 
ment. 

Against this restricted definition of attitude 
as the affective character of potential action 
about a psychological object there have been 
raised several questions. It has been pointed 
out that the emotional experiences of the past 
constitute an integral part of a man’s attitude. 
If we should use the term in that inclusive 
manner, we should say that a man’s attitude 
toward religion consists in part of his child- 
hood experience with Sunday schools. The man 
might then say that his attitude toward religion 
is that he went to church when he was a child. 
I should prefer to say that such a fact is really 
not a part of his attitude toward religion but 
that it may help to explain how he got that 
way. It is quite conceivable that two men may 
have the same degree or intensity of affect 
favorable toward a psychological object and 
that their attitudes would be described in this 
sense as identical but that they arrived at 
their similar attitudes by entirely different 
routes. It is even possible that their factual 
associations about the psychological object 
might be entirely different and that their overt 
actions would take quite different forms which 
have one thing in common, namely, that they 
are about equally favorable toward the object. 

In these discussions the term psychological 
object has its customary meaning. It may refer 
to a physical object, or it may refer to an idea, 
a plan of action, a form of conduct, an ideal, 
a moral principle, a slogan, or a symbol. In 
fact, it may refer to any idea about which the 
subject may express positive or negative affect. 

There comes to mind the uncertainty of us- 
ing an opinion as an index of attitude. The 
man may be a liar. If he is not intentionally 
misrepresenting his real attitude on a disputed 
question, he may nevertheless modify the ex- 
pression of it for reasons of courtesy, especially 
in those situations in which frank expression 
of attitude may not be well received. This has 
led to the suggestion that a man's action is a 
safer index of his attitude than what he says. 
But his actions may also be distortions of his 
attitude. A politician extends friendship and 
hospitality in overt action toward a Negro 


while hiding an attitude that he expresses 


more truthfully to an intimate friend. Neith 
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his opinions nor his overt act constitutes in 
any sense an infallible guide to the subjective 
inclinations and preferences that constitute his 
attitude. Therefore we must remain content to 
use opinions or other forms of action merely 
as indexes of attitude. It must be recognized 
that there is a discrepancy, some error of mea- 
surement as it were, between the opinion or 
overt action that we use as an index and the 
attitude that we infer from such an index. 
But this discrepancy between the index and 
"truth" is universal. When you want to know 
the temperature of your room, you look at the 
thermometer and use its reading as an index 
of temperature just as though there were a 
single temperature reading which is the "cor- 
rect" one for the room. If it is desired to ascer- 
tain the volume of a glass paperweight, the 
volume is postulated as an attribute of the 
piece of glass, even though volume is an ab- 
straction. The volume is measured indirectly „y 
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by noting the dimensions of the glass or by O 
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immersing it in water to see how much water tag 


it displaces. These two procedures give two 4 


indexes which may not agree exactly. In every y9 
situation involving measurement there is pos- WL 


tulated an abstract continuum such as volume 
or temperature, and the allocation of the thing $ 
measured to that continuum is accomplished $ 
by indirect means through one or more indexes. 
Truth consists only in the relative consistency 
of the several indexes, since it is never directly 
known. We are dealing with the same type of 
situation in attempting to measure attitude. 
We postulate an attitude variable which is like 
practically all other measurable attributes in 
the nature of an abstract continuum, and we 
must find one or more indexes which will sat- 
isfy us to the extent that they are internally 
consistent. Only to the extent that different in- 
dexes are consistent can we be justified in pos- 
tulating the attitude as a trait, for a trait is 
never directly measured. 

If we should find that what a man says has 
absolutely no relation to what he does, then 
such inconsistency would constitute a serious 
limitation on the legitimacy of the abstraction 
of attitude. However, when we actually carry 
out such comparisons, we do find that the cor- 
relation is positive between verbally expressed 
attitudes and overt action. The correlation is 
not perfect, but it is certainly positive. Dis- 
crepancies will arise, as when a subject ex- 
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presses himself as favorable to prohibition 
p al 


jough he himself violates it. But if the cor- 
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relation is tabulated for a large group of 
people, it is found that the attitude scores for 
those who vote for prohibition are markedly 
different from those who vote against it, and 
a similar positive correlation is found between 
attitude scores and overt action about drinking. 
The reason for limiting ourselves to verbal 
expressions of attitude is that they can be eval- 
uated with more certainty, and they are much 
more available than a list of overt acts. This 
type of correlation between attitude and overt 
action was the subject of a recent study by 
Mr. Stouffer. 

In order to deal with overt actions as expres- 
sive of attitudes in a feasible manner Mr. 
Rosander has prepared lists of situations with 
alternative overt acts. He asks the question, 
"In the following situations, which of the given 
alternatives are you most likely to do?" Then 
follows description of a situation with a num- 
ber of alternative overt acts. For example, in 
a situation scale on the Negro occurs the fol- 
lowing: 


The congregation of the church you attend 
has always been white. One Sunday morning 
a Negro attends the services. 


You do nothing about it. 

You complain to the minister. 

You welcome the Negro to the church. 

- You shake hands with him and ask him 
to come again. 

€. You sound out opinion to find how many 
want to keep out Negroes, 

f. You tell the fellow he had better move 
along. 

E. You ask the minister to tell him that he 
is not wanted. 

h. You tell the fellow he had better leave 
before you throw him out. 

i. You defend the Negro against some who 
complain of his presence. 

j You give the Negro friendly warning 

that perhaps he had better not come back. 


Roop 


The various overt responses are to be scaled 
in a manner analogous to the procedure for 
the statements of opinion. It is quite probable 
that these two types of scale, the opinion scale 
and the situation scale, will be highly corre- 
lated. 

We take for granted that people’s attitudes 
are subject to change. When we have measured 
a man's attitude on any issue such as pacifism, 


we shall not declare such a measurement to 
be in any sense an enduring or constitutional 
constant. His attitude may change, of course, 
from one day to the next, and it is our task 
to measure such changes, whether they be due 
to unknown causes or to the presence of some 
known persuasive factor, such as the reading 
of a discourse on the issue in question. How- 
ever, such fluctuations may also be attributed 
in part to error in the measurements them- 
selves. In order to isolate the errors of the 
measurement instrument from actual fluctua- 
tions in attitude, we must calculate the stan- 
dard error of measurement of the scale itself, 
and this can be accomplished by methods al- 
ready well known in mental measurement. 
We shall assume that an attitude scale is 
used only in those situations in which one may 
reasonably expect people to tell the truth about 
their convictions or opinions. If a denomina- 
tional school were to submit to its students a 
scale of attitude about the church, one might 
find that some students would hesitate to make 
known their convictions if they deviate from 
the orthodox beliefs of their school. At least, 
the findings could be challenged if the situation 
in which attitudes were expressed contained 
pressure or implied threat bearing directly on 
the attitude to be measured. Similarly, it is 
difficult to discover attitudes on sex liberty by 
a written questionnaire, because of the well- 
nigh universal pressure to conceal such atti- 
tudes when they deviate from supposed con- 
ventions. It is assumed that attitude scales will 
be used primarily in those situations that offer 
a minimum of pressure on the attitude to be 
measured. Such situations are common enough. 
However, it is sometimes of considerable 
interest to inquire what the distribution of 
attitude may be in a group which is known to 
be influenced by social pressure or taboo. If, 
for example, a group of college students are 
asked to express their attitude on the subject 
of sex liberty, the results might indicate con- 
formity with conventional standards. Such a 
result might be interpreted to mean that the 
students agree with conventional ideals in re- 
gard to sex liberty, or the results might be 
challenged as reflecting only the social taboo 
against deviations from the conventional stan- 
dards. If, on the other hand, the results should 
be a distribution of unconventional attitudes, 
the interpretation would be more conclusive 
in that the expressed attitude appears in spite 
of the known taboo. It goes without saying that 
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the distribution of attitude on any social issue 
and with any particular group must be inter- 
preted in terms of the known factors that may 
influence judgment. 

The question has been raised whether the 
concept of attitude as here used and as mea- 
sured by an attitude scale is not hypothetical 
rather than “real.” It is just as hypothetical as 
the concept of intelligence, which is measured 
by what it supposedly does. But these concepts 
are hypothetical in the same sense that the con- 
cepts force, momentum, volume, are hypothet- 
ical in physical science. No one has ever seen 
or touched a force or a momentum or a volume. 
They are measured by what they supposedly do. 
The legitimacy of these abstractions can be 
tested only in the consistency by which they 
operate in experience. Not infrequently these 
hypothetical entities are discarded either, first, 
because they lead to inconsistencies in experi- 
ence or, second, because they have to be multi- 
plied in number so that they become as 
numerous as the effects that they are intended 
to explain or facilitate in analysis. As long as 
biologists insisted on the definition of instincts 
in terms of overt acts, they found that the in- 
stincts had to be as numerous as the overt acts 
to be accounted for, and then the instinct ab- 
straction lost its usefulness. The instincts prob- 
ably could be defined more successfully in terms 
of the other end of the psychological act. The 
concept of intelligence is a useful though hypo- 
thetical entity. It is postulated that intelligence 
is that which is dynamically common to a large 
group of overt acts. The degree of this hypo- 
thetical power which we call intelligence is esti- 
mated in terms of overt performances, and the 
term is successful to the extent that different 
forms of adaptive overt performance are posi- 
tively correlated. The greater confidence with 
which we handle such a hypothetical entity as 
force is completely contained in the higher de- 
gree of consistency with which the hypothetical 
force is measured in different forms of its ex- 
pression. We are here dealing with a similar 
hypothetical entity, attitude toward a social or 
Psychological object. It is part of the conve- 
nience of language to speak of a force as though 
it were the “cause” of the movement of a par- 
ticle, and just so we speak of intelligence as the 
cause of conduct that is regarded as particularly 
adaptive, and we speak of attitude as the cause 
of that conduct which is favorable or unfavor- 
able toward a psychological object. 

The consistency by which the term “attitude” 


may be established as useful is to be found in 
the indorsements of different opinions. Con- 
sider these two opinions about the church: “I 
find the services of the church both restful and 
inspiring,” and “I think the church is a divine 
institution.” One of these statements concerns 
the effect of the church service on the individ- 
ual subject. The other expresses a belief in the 
divine character of the church. Considered ob- 
jectively and logically, there should be no neces- 
sary correspondence between these two state- 
ments. They are declarations about totally dif- 
ferent things, objectively regarded. But among 
all the people who indorse the first statement 
there will be a large proportion who also in- 
dorse the second statement. That which these 
two objectively entirely different statements 
have in common is the postulated favorable 
affect of the subject toward the psychological 
object, the church. To the extent that such con- 
sistencies in indorsements can be found, we are 
justified in postulating a common core and in 
naming it. 

An example of the opposite kind may serve 
better to illustrate the justification for postulat- 
ing the common factor of attitude. Consider 
the statement, “Going to church will not do 
anyone any harm.” You find by actually trying 
it that the pious people indorse this statement. 
They can hardly do otherwise. On the other 
hand, some of the hard-boiled atheists also in- 
dorse it if they don’t think that churchgoing is 
going to be specially harmful. Here you have 
people from both ends of the affective con- 
tinuum indorsing the same statement. If all of 
our opinions behaved that way upon statistical 
analysis, we would have no attitude scales, and 
we would not even be able to postulate the at- 
titude continuum. But fortunately most of the 
opinions that we now write behave much better 
than that one did. This criterion we have called 
the criterion of irrelevance. 

Perhaps the nature of the underlying concept 
of attitude can be finally best illustrated by an 
extreme example that I have found useful on 
several occasions. Suppose that you would all 
be given a list of statements about communism 
with the request that you check those state- 
ments that you think are true, and suppose 
that you were in a sufficiently docile mood to 
undertake the task. Some of the statements are 
very frankly favorable to communism, some are 
even extreme, others straddle the question with 
neutral assertions, others are derogatory, and 
some are bitter denunciations of the communists. 
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Now I shall venture the guess that the large 
majority of this audience would register strong 
denunciation of communism. Since these state- 
ments haye been scaled on an affect continuum, 
I could plot the frequency distribution of your 
attitudes toward communism, and it would 
probably be strongly skewed with a mode at 
the anti-end of the scale. Here you would have 
given me a clear indication that you react 
against this social object or symbol that has 
been placed before you. 

But now suppose that I should turn on you 
with the request that you all write out a state- 
ment of what it is that you have been talking 
about, just what these doctrines are that you 
have so universally denounced. I should then 
not be at all surprised if some of you would 
have difficulty in telling me just what the doc- 
trines of communism are toward which you 
have reacted so strongly. Those of you who 
would venture to write such statements would 
undoubtedly differ widely in what you say that 
you have expressed yourself about. Now the 
important point for the purpose of attitude 
measurement is that your vagueness in supply- 
ing cognitive detail does not in the least inval- 
idate your expression of attitude. Even though 
you might differ in your detailed description of 
the communism symbol, you might all agree 
that it is bad, that those who subscribe to such 
a symbol should be sent to jail, and that such 
doctrines should be kept out of the country. We 
have here a clear registration of affective value, 
a strong negative affective valuation of a sym- 
bol, and that is all that we are trying to find 
when we measure social attitudes. Of course, it 
would be socially interesting and perhaps im- 
portant that we should get together on what it 
is that we feel so strongly about, but for the 
purposes of attitude measurement we have done 
our task when the positive or negative affect 
has been recorded in terms of the discriminal 
dispersion as a unit of measurement. 

I must make it clear that in discussing these 
various issues, such as religion, communism, 
birth control, municipal ownership, race preju- 
dice, pacifism, and so on, I am not advancing 
any doctrines whatever. I have personal convic- 
tions on some of these issues, and so does every- 
body else, but in these studies we are concerned 
merely with the description of the degree of af- 
fect for or against various social symbols by 
psychophysical methods. In giving each person 
a positive or negative score on a disputed social 


issue, we do not say anything whatever as to 
whether his attitude is good or bad, whether 
his attitude should be censured or encouraged. 
"That is a matter of interpretation in each issue, 
and it is not the scientific problem with which 
we are concerned. 

In closing, I shall mention only one other 
question that has appeared on several occasions. 
It has been suggested that the attitude scales 
might be used in order to eliminate undesirable 
students from colleges and universities. They 
might be given an attitude scale on patriotism 
or on religion or on something else that is sup- 
posed to tell whether a person is desirable or 
not from any particular point of view. In the 
first place, you would immediately make liars 
of many applicants who differ with you in their 
political or religious convictions on the issue in 
question. We have not yet combined the atti- 
tude scales with the lie detector, although such 
experiments are contemplated. But even if it 
were possible to ascertain the political and re- 
ligious attitudes of people under conditions 
which would detect when they are lying, it 
would be a vicious policy for any educational 
institution to adopt. 

It has been proposed that attitude scales 
might be used to determine whether a course 
of instruction in social science has had the de- 
sired effect. To be sure, one of the important 
results of social science instruction is change in 
social attitude. But to make the passing of a 
course contingent on taking the so-called right 
attitude on any particular social issue would be 
preposterous. I am unalterably opposed to any 
such policy for judging progress in social sci- 
ence courses. 

I have reviewed some of our attempts to ex- 
tend the experimental methods and the logic of 
psychophysics beyond the field of sensory dis- 
crimination to which it has been limited by 
psychological tradition. It has been stated by 
economists and by other social scientists that 
affect cannot be measured, and some of the fun- 
damental theory of social science has been writ- 
ten with this explicit reservation. Our studies 
have shown that affect can be measured. In ex- 
tending the methods of psychophysics to the 
measurement of affect, we seem to see the pos- 
sibility of a wide field of application by which 
it will be possible to apply the methods of quan- 
titative scientific thinking to the study of feel- 
ing and emotion, to aesthetics, and to social 
phenomena. 


om 
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3. Attitudes versus Actions 


RICHARD T. LAPIERE 


By definition, a social attitude is a behaviour 
pattern, anticipatory set or tendency, predispo- 
sition to specific adjustment to designated social 
situations, or, more simply, a conditioned re- 
sponse to social stimuli.! Terminological usage 
differs, but students who have concerned them- 
selves with attitudes apparently agree that they 
are acquired out of social experience and pro- 
vide the individual organism with some degree 
of preparation to adjust, in a well-defined way, 
to certain types of social situations if and when 
these situations arise. It would seem, therefore, 
that the totality of the social attitudes of a sin- 
gle individual would include all his socially ac- 
1 quired personality which is involved in the 
. making of adjustments to other human beings. 
- But by derivation social attitudes are seldom 
more than a verbal response to a symbolic situ- 
ation. For the conventional method of measur- 
ing social attitudes is to ask questions (usually 
in writing) which demand a verbal adjustment 
to an entirely symbolic situation. Because it is 
easy, cheap, and mechanical, the attitudinal 
questionnaire is rapidly becoming a major 
method of sociological and socio-psychological 
investigation. The technique is simple. Thus 
from a hundred or a thousand responses to the 
question "Would you get up to give an Arme- 
nian woman your seat in a street car?" the in- 
vestigator derives the "attitude" of non-Arme- 
nian males towards Armenian females. Now the 
question may be constructed with elaborate skill 
and hidden with consummate cunning in a maze 
of supplementary or even irrelevant questions 
yet all that has been obtained is a symbolic re- 
sponse to a symbolic situation. The words “Ar- 


° Reprinted from Social Forces, 1934, 13, 230-237, 
with permission of the author and the publisher. 


1 See Daniel D. Droba, “Topical Summaries of 
Current Literature,” The American Journal of 


Sociology, 1934, p. 513. 
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menian woman” do not constitute an Armenian 
woman of flesh and blood, who might be tall 
or squat, fat or thin, old or young, well or 
poorly dressed—who might, in fact, be a god- 
dess or just another old and dirty hag. And the 
questionnaire response, whether it be “yes” or 
“no,” is but a verbal reaction and this does not 
involve rising from the seat or stolidly avoiding 
the hurt eyes of the hypothetical woman and 
the derogatory stares of other street-car occu- 
pants. Yet, ignoring these limitations, the dili- 
gent investigator will jump briskly from his 
factual evidence to the unwarranted conclusion 
that he has measured the “anticipatory behav- 
ior patterns” of non-Armenian males towards 
Armenian females encountered on street cars. 
Usually he does not stop here, but proceeds to 
deduce certain general conclusions regarding 
the social relationships between Armenians and 
non-Armenians. Most of us have applied the 
questionnaire technique with greater caution, 
but not I fear with any greater certainty of 
success, 

Some years ago I endeavored to obtain com- 
parative data on the degree of French and En- 
glish antipathy towards dark-skinned peoples.? 
The informal questionnaire technique was used, 
but, although the responses so obtained were 
exceedingly consistent, I supplemented them 
with what I then considered an index to overt 
behavior. The hypothesis as then stated seemed 
entirely logical: 


Whatever our attitude on the validity of “ver- 
balization" may be, it must be recognized that 
any study of attitudes through direct question- 
ing is open to serious objection, both because 
of the limitations of the sampling method and 
because in classifying attitudes the inaccuracy 


2 "Race Prejudice: France and England," Social 
Forces, September, 1928, pp. 102-111. 


of human judgment is an inevitable variable. 
In this study, however, there is corroborating 
evidence on these attitudes in the policies 
adopted by hotel proprietors. Nothing could be 
used as a more accurate index of color preju- 
dice than the admission or non-admission of 
colored people to hotels. For the proprietor 
must reflect the group attitude in his policy re- 
gardless of his own feelings in the matter. Since 
he determines what the group attitude is to- 
wards Negroes through the expression of that 
attitude in overt behavior and over a long pe- 
riod of actual experience, the results will be 
exceptionally free from those disturbing factors 
which inevitably affect the effort to study atti- 
tudes by direct questioning. 


But at that time I overlooked the fact that 
what I was obtaining from the hotel proprietors 
was still a "verbalized" reaction to a symbolic 
situation. The response to a Negro's request for 
lodgings might have been an excellent index of 
the attitude of hotel patrons towards living in 
the same hotel as a Negro. Yet to ask the pro- 
prietor “Do you permit members of the Negro 
race to stay here?" does not, it appears, measure 
his potential response to an actual Negro. 

All measurement of attitudes by the question- 
naire technique proceeds on the assumption 
that there is a mechanical relationship between 
symbolic and non-symbolic behavior. It is sim- 
ple enough to prove that there is no necessary 
correlation between speech and action, between 
response to words and to the realities they sym- 
bolize. A parrot can be taught to swear, a child 
to sing "Frankie and Johnny" in the Mae West 
manner. The words will have no meaning to 
either child or parrot. But to prove that there 
is no necessary relationship does not prove that 
such a relationship may not exist. There need 
be no relationship between what the hotel pro- 
prietor says he will do and what he actually 
does when confronted with a colored patron. 
Yet there may be. Certainly we are justified in 
assuming that the verbal response of the hotel 
proprietor would be more likely to indicate 
what he would actually do than would the 
verbal response of people whose personal feel- 
ings are less subordinated to economic expedi- 
ency. However, the following study indicates 
that the reliability of even such responses is 
very small indeed. 

Beginning in 1930 and continuing for two 
years thereafter, I had the good fortune to 
travel rather extensively with a young Chinese 
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student and his wife? Both were personable, 
charming, and quick to win the admiration and 
respect of those they had the opportunity to 
become intimate with. But they were foreign- 
born Chinese, a fact that could not be disguised. 
Knowing the general "attitude" of Americans 
towards the Chinese as indicated by the "social 
distance" studies which have been made, it was 
with considerable trepidation that I first ap- 
proached a hotel clerk in their company. Per- 
haps that clerk's eyebrows lifted slightly, but he 
accommodated us without a show of hesitation. 
And this in the "best" hotel in a small town 
noted for its narrow and bigoted “attitude” to- 
wards Orientals. Two months later I passed 
that way again, phoned the hotel and asked if 
they would accommodate "an important Chi- 
nese gentleman." The reply was an unequivocal 
“No.” That aroused my curiosity and led to this 
study. 

In something like ten thousand miles of mo- 
tor travel, twice across the United States, up 
and down the Pacific Coast, we met definite re- 
jection from those asked to serve us just once. 
We were received at 66 hotels, auto camps, and 
“Tourist Homes,” refused at one. We were 
served in 184 restaurants and cafes scattered 
throughout the country and treated with what 
I judged to be more than ordinary considera- 
tion in 72 of them. Accurate and detailed rec- 
ords were kept of all these instances. An effort, 
necessarily subjective, was made to evaluate the 
overt response of hotel clerks, bell boys, eleva- 
tor operators, and waitresses to the presence of 
my Chinese friends. The factors entering into 
the situations were varied as far and as often 
as possible. Control was not, of course, as exact- 
ing as that required by laboratory experimen- 
tation. But it was as rigid as is humanly possi- 
ble in human situations. For example, I did 
not take the “test” subjects into my confidence 
fearing that their behavior might become self- 
conscious and thus abnormally affect the 
response of others towards them. Whenever pos- 
sible I let my Chinese friend negotiate for ac- 
commodations (while I concerned myself with 
the car or luggage) or sent them into a restau- 
rant ahead of me. In this way I attempted to 
“factor” myself out. We sometimes patronized 
high-class establishments after a hard and dusty 
day on the road and stopped at inferior auto 
camps when in our most presentable condition. 


3 The results of this study have been withheld 
until the present time out of consideration for 
their feelings. 
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In the end I was forced to conclude that 
those factors which most influenced the behav- 
ior of others towards the Chinese had nothing 
at all to do with race. Quality and condition of 
clothing, appearance of baggage (by which, it 
seems, hotel clerks are prone to base their quick 
evaluations), cleanliness and neatness were far 
more significant for person to person reaction 
in the situations I was studying than skin pig- 
mentation, straight black hair, slanting eyes, 
and flat noses. And yet an air of self-confidence 
might entirely offset the “unfavorable” impres- 
sion made by dusty clothes and the usual dis- 
order to appearance consequent upon some 
hundred miles of motor travel. A supercilious 
desk clerk in a hotel of noble aspirations could 
not refuse his master’s hospitality to people who 
appeared to take their request as a perfectly 
normal and conventional thing, though they 
might look like tin-car tourists and two of them 
belong to the racial category “Oriental.” On 
the other hand, I became rather adept at ap- 
proaching hotel clerks with that peculiar crab- 
wise manner which is so effective in provoking 
a somewhat scornful disregard. And then a 
bland smile would serve to reverse the entire 
situation. Indeed, it appeared that a genial 
smile was the most effective password to accep- 
tance. My Chinese friends were skillful smilers, 
which may account, in part, for the fact that 
we received but one rebuff in all our experi- 
ence. Finally, I was impressed with the fact that 
even where some tension developed due to the 
strangeness of the Chinese it would evaporate 
immediately when they spoke in unaccented 
English, 

The one instance in which we were refused 
accommodations is worth recording here. The 
place was a small California town, a rather in- 
ferior auto-camp into which we drove in a very 
dilapidated car piled with camp equipment. It 
was early evening, the light so dim that the 
Proprietor found it somewhat difficult to decide 
the genus voyageur to which we belonged. I 
left the car and spoke to him. He hesitated, 
wavered, said he was not sure that he had two 
cabins, meanwhile edging towards our car. The 
realization that the two occupants were Orien- 
tals turned the balance or, more likely, gave 
him the excuse he was looking for. "No," he 
said, “I don’t take Japs!” In a more pretentious 
establishment we secured accommodations, and 
with an extra flourish of hospitality. 

To offset this one flat refusal were the many 
instances in which the physical peculiarities of 


the Chinese served to heighten curiosity. With 
few exceptions this curiosity was considerately 
hidden behind an exceptional interest in serv- 
ing us. Of course, outside of the Pacific Coast 
region, New York, and Chicago, the Chinese 
physiognomy attracts attention. It is different, 
hence noticeable. But the principal effect this 
curiosity has upon the behavior of those who. 
cater to the traveler's needs is to make them 
more attentive, more responsive, more reliable. 
A Chinese companion is to be recommended to 
the white traveling in his native land. Strange 
features when combined with "human" speech 
and action seems, at times, to heighten sympa- 
thetic response, perhaps on the same principle 
that makes us uncommonly sympathetic towards 
the dog that has a “human” expression in his 
face. 

What I am trying to say is that in only one 
out of 251 instances in which we purchased 
goods or services necessitating intimate human 
relationships did the fact that my companions 
were Chinese adversely affect us. Factors entirely 
unassociated with race were, in the main, the 
determinant of significant variations in our re- 
ception. It would appear reasonable to con- 
clude that the "attitude" of the American peo- 
ple, as reflected in the behavior of those who 
are for pecuniary reasons presumably most sen- 
sitive to the antipathies of their white clientele, 
is anything but negative towards the Chinese. 
In terms of "social distance" we might con- 
clude that native Caucasians are not averse to 
residing in the same hotels, auto-camps, and 
"Tourist Homes" as Chinese and will with 
complacency accept the presence of Chinese at 
an adjoining table in restaurant or cafe. It does 
not follow that there is revealed a distinctly 
"positive" attitude towards the Chinese, that 
whites prefer the Chinese to other whites. But 
the facts as gathered certainly preclude the con- 
clusion that there is an intense prejudice to- 
wards the Chinese. 

Yet the existence of this prejudice, very in- 
tense, is proven by a conventional "attitude" 
study. To provide a comparison of symbolic re- 
action to symbolic social situations with actual 
reaction to real social situations, I "question- 
naired" the establishments which we patron- 
ized during the two year period. Six months 
were permitted to lapse between the time I 
obtained the overt reaction and the symbolic. 
It was hoped that the effects of the actual ex- 
perience with Chinese guests, adverse or other- 
wise, would have faded during the intervening 


time. To the hotel or restaurant a questionnaire 
was mailed with an accompanying letter pur- 
porting to be a special and personal plea for 
response. The questionnaires all asked the same 
question, “Will you accept members of the Chi- 
nese race as guests in your establishment?” Two 
types of questionnaire were used. In one this 
question was inserted among similar queries 
concerning Germans, French, Japanese, Rus- 
sians, Armenians, Jews, Negroes, Italians, and 
Indians. In the other the pertinent question 
was unencumbered. With persistence, completed 
replies were obtained from 128 of the establish- 
ments we had visited; 81 restaurants and cafes 
and 47 hotels, auto-camps, and “Tourist 
Homes.” In response to the relevant question 
92 per cent of the former and 91 per cent of the 
latter replied “No.” The remainder replied 
“Uncertain; depend upon circumstances.” From 
the woman proprietor of a small auto-camp I 
received the only “Yes,” accompanied by a 
chatty letter describing the nice visit she had 
had with a Chinese gentleman and his sweet 
wife during the previous summer. 

A rather unflattering interpretation might be 
put upon the fact that those establishments who 
had provided for our needs so graciously were, 
some months later, verbally antagonistic to- 
wards hypothetical Chinese. To factor this ex- 
perience out responses were secured from 32 
hotels and 96 restaurants located in approxi- 
mately the same regions, but uninfluenced by 
this particular experience with Oriental clients. 
In this, as in the former case, both types of 
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questionnaires were used. The results indicate 
that neither the type of questionnaire nor the 
fact of previous experience had important bear- 
ing upon the symbolic response to symbolic 
social situations. 

It is impossible to make direct comparison 
between the reactions secured through ques- 
tionnaires and from actual experience. On the 
basis of the above data (see Table 1) it would 
appear foolhardy for a Chinese to attempt to 
travel in the United States. And yet, as I have 
shown, actual experience (Table 2) indicates 
that the American people, as represented by 
the personnel of hotels, restaurants, etc., are not 
at all averse to fraternizing with Chinese within 
the limitations which apply to social relation- 
ships between Americans themselves. The eval- 
uations which follow are undoubtedly subject 
to the criticism which any human judgment 
must withstand. But the fact is that, although 
they began their travels in this country with 
considerable trepidation, my Chinese friends 
soon lost all fear that they might receive a re- 
buff. At first somewhat timid and considerably 
dependent upon me for guidance and support, 
they came in time to feel fully self-reliant and 
would approach new social situations without 
the slightest hesitation. 

The conventional questionnaire undoubtedly 
has significant value for the measurement of 
“political attitudes." The presidential polls 
conducted by the Literary Digest have proven 
that. But a “political attitude” is exactly what 
the questionnaire can be justly held to measure; 


TABLE 1. Distribution of Results from Questionnaire Study of Establishment 
“Policy” Regarding Acceptance of Chinese as Guests 
Replies are to the question: “Will you accept members of the 


Chinese race as guests in your establishment?” 


Hotels, etc., 
Visited 
Total 47 


Number replying 22 25 
No 
Undecided: depend 


upon circumstances 1 2 
Yes 1 0 


Restaurants, 
Hotels, etc., Restaurants, etc., 

not Visited etc., Visited not Visited 
32 81 96 

2 1 2 1 2 

12 43 38 51 45 

11 40 35 37 41 

1 3 3 4 3 

0 0 0 0 1 


e ee 
* Column (1) indicates in each case those responses to questionnaires which concerned Chinese 


only. The figures in columns (2) are from the questionnaires in which the above was inserted 
among questions regarding Germans, French, Japanese, etc. 
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TABLE 2. Distribution of Results Obtained from Actual Experience in the 


Situation Symbolized in the Questionnaire Study 


Conditions 
Total 


Reception very much better than investi- 
gator would expect to have received 
had he been alone, but under other- 
wise similar circumstances 

Reception different only to extent of 
heightened curiosity, such as investiga- 
tor might have incurred were he alone 
but dressed in manner unconventional 
to region yet not incongruous 

Reception "normal" 

Reception perceptibly hesitant and not to 
be explained on other than "racial" 
grounds 

Reception definitely, though temporarily, 
embarrassing 

Not accepted 


Hotels, etc. Restaurants, etc. 
Chinese Chinese 
not so not so 

Accom- accom- Accom- accom- 
panied panied panied panied 
by at incep- by at incep- 
investi- tion of investi- tion of 
gator situation gator situation 
55 12 165 19 
19 6 63 9 
22 3 76 6 

9 21 3 

8 1 4 1 

1 0 1 0 


* When the investigator was not present at the inception of the situation the judgments were 
based upon what transpired after he joined the Chinese. Since intimately acquainted with them 
it is probable that errors in judgment were no more frequent under these conditions than when 
he was able to witness the inception as well as results of the situation. 


à verbal response to a symbolic situation. Few 
citizens are ever faced with the necessity of ad- 
justing themselves to the presence of the polit- 
ical leaders whom, periodically, they must vote 
for—or against. Especially is this true with re- 
gard to the president, and it is in relation to 
political attitudes towards presidential candi- 
dates that we have our best evidence. But while 
the questionnaire may indicate what the voter 
Will do when he goes to vote, it does not and 
cannot reveal what he will do when he meets 
Candidate Jones on the street, in his office, at 
his club, on the golf course, or wherever two 
men may meet and adjust in some way one to 
the other. 

The questionnaire is probably our only 
means of determining "religious attitudes." An 
honest answer to the question "Do you believe 
in God?" reveals all there is to be measured. 


"God" is a symbol; "belief" a verbal expression. 
So here, too, the questionnaire is efficacious. 
But if we would know the emotional respon- 
siveness of a person to the spoken or written 
word "God" some other method of investiga- 
tion must be used. And if we would know the 
extent to which that responsiveness restrains 
his behavior it is to his behavior that we must 
look, not to his questionnaire response. Ethical 
precepts are, I judge, something more than 
verbal professions. "There would seem little to 
be gained from asking a man if his religious 
faith prevents him from committing sin. Of 
course it does—on paper. But “moral attitudes" 
must have a significance in the adjustment to 
actual situations or they are not worth the 
studying. Sitting at my desk in California I can 
predict with a high degree of certainty what an 
“average” business man in an average Mid- 


— ^ 


Western city will reply to the question "Would 
you engage in sexual intercourse with a prosti- 
tute in a Paris brothel?" Yet no one, least of all 
the man himself, can predict what he would 
actually do should he by some misfortune find 
himself face to face with the situation in ques- 
tion. His moral "attitudes" are no doubt al- 
ready stamped into his personality. But just 
what those habits are which will be invoked to 
provide him with some sort of adjustment to 
this situation is quite indeterminate. 

It is highly probable that when the "Southern 
Gentleman" says he will not permit Negroes to 
reside in his neighborhood we have a verbal 
response to a symbolic situation which reflects 
the "attitudes" which would become operative 
in an actual situation. But there is no need to 
ask such a question of the true "Southern Gen- 
tleman." We knew it all the time. I am in- 
clined to think that in most instances where 
the questionnaire does reveal non-symbolic at- 
titudes the case is much the same. It is only 
when we cannot easily observe what people do 
in certain types of situations that the question- 
naire is resorted to. But it is just here that the 
danger in the questionnaire technique arises. 
If Mr. A adjusts himself to Mr. B in a specified 
way we can deduce from his behavior that he 
has a certain "attitude" towards Mr. B and, per- 
haps, all of Mr. B's class. But if no such overt 
adjustment is made it is impossible to discover 
what A's adjustment would be should the situ- 
ation arise, A questionnaire will reveal what 
Mr. A writes or says when confronted with a 
certain combination of words. But not what he 
will do when he meets Mr. B. Mr. B is a great 
deal more than a series of words. He is a man 
and he acts. His action is not necessarily what 
Mr. A "imagines" it will be when he reacts 
verbally to the symbol "Mr. B." 

No doubt a considerable part of the data 
which the social scientist deals with can be ob- 
tained by the questionnaire method. The cen- 
sus reports are based upon verbal questionnaires 
and I do not doubt their basic integrity. If we 
Wish to know how many children a man has, 
his income, the size of his home, his age, and 
the condition of his parents, we can reasonably 
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ask him. These things he has frequently and 
conventionally converted into verbal responses. 
He is competent to report upon them, and will 
do so accurately, unless indeed he wishes to do 
otherwise. A careful investigator could no doubt 
even find out by verbal means whether the man 
fights with his wife (frequently, infrequently, 
or not at all), though the neighbors would be 
a more reliable source. But we should not ex- 
pect to obtain by the questionnaire method his 
"anticipatory set or tendency" to action should 
his wife pack up and go home to Mother, 
should Elder Son get into trouble with the 
neighbor's daughter, the President assume the 
status of a dictator, the Japanese take over the 
rest of China, or a Chinese gentleman come to 
pay a social call. 

Only a verbal reaction to an entirely sym- 
bolic situation can be secured by the question- 
naire. It may indicate what the responder 
would actually do when confronted with the 
situation symbolized in the question, but there 
is no assurance that it will. And so to call the 
response a reflection of a “social attitude" is to 
entirely disregard the definition commonly 
given for the phrase "attitude." If social atti- 
tudes are to be conceptualized as partially in- 
tegrated habit sets which will become operative 
under specific circumstances and lead to a par- 
ticular pattern of adjustment they must, in the 
main, be derived from a study of humans be- 
having in actual social situations. They must 
not be imputed on the basis of questionnaire 
data. 

The questionnaire is cheap, easy, and me- 
chanical. The study of human behavior is time 
consuming, intellectually fatiguing, and de- 
pends for its success upon the ability of the 
investigator. The former method gives quan- 
titative results, the latter mainly qualitative, 
Quantitative measurements are quantitatively 
accurate; qualitative evaluations are always sub- 
ject to the errors of human judgment. Yet it 
would seem far more worth while to make a 
shrewd guess regarding that which is essential 
than to accurately measure that which is likely 
to prove quite irrelevant. 


4. Verbal Stereotypes and Racial Prejudice 


DANIEL KATZ and KENNETH W. BRALY 


One outstanding result of investigations of ra- 
cial prejudice is the uniformity in the patterns 
of discrimination against various races! shown 
by Americans throughout the United States. 
People in widely separated parts of the country 
show a high degree of agreement in their ex- 
pressions of relative liking or disliking of differ- 
ent “foreign” groups. 

In an early study Bogardus asked 110 busi- 
nessmen and schoolteachers about the degrees 
of social intimacy to which they were willing to 
admit certain ethnic groups. The degrees of so- 
cial distance employed were: to close kinship 
through marriage, to my club as personal 
chums, to my street as neighbors, to employ- 
ment in my occupation, to citizenship in my 
country, to my country as visitors only, and ex- 
clusion from my country. By weighting these 
seven classifications Bogardus obtained the fol- 
lowing preferential rating of 23 ethnic groups: 


Canadians . 22.51 
English . 22.35 
Scotch 20.91 
Iris aly 19.38 
French . 18.67 
Swedes 16.20 
Germans 14.95 
Spanish . 14.02 


* Adapted by the authors from "Racial Stereotypes 
of 100 College students," Journal of Abnormal and 
Social Psychology, 1933, 28, 280-290, and “Racial 
Prejudice and Racial Stereotypes,” ibid., 1935, 30, 
175-193, with permission of the American Psycho- 
logical Association, Inc. This article originally ap- 
peared in T. M. Newcomb and E. L. Hartley 
(Eds), Readings in Social Psychology, lst Edition, 
Holt, Rinehart and Winston, New York, 1947, and 
is reprinted with permission of the author, the 
m. and the American Psychological Asso- 
ation. 


1 The term race is here used in the popular, not 
the scientific, sense, and covers reference to racial, 
religious, and national groupings. 
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The Bogardus study was carried out on the 
Pacific Coast but studies made in other parts of 
the United States indicate the same pattern of 
preferences for the various groups. In the Mid- 
dle West, for example, Thurstone constructed a 
scale on the basis of the likes and dislikes of 239 
students. The resulting rank order and scale 
values for 21 ethnic groups follows: 


Arerio 0 4 ork LI ws 0.00 
English —1.84 
Scotch nt i eae. | a 
Dio. vw. MR Ns webs IN 
French . —2.46 
German —2.55 
Swede y —2.90 
South American —3.64 
Italian . —3.66 
Spanish . —8.79 
Jew . . —892 
Russian —4.10 
Pole. . —4.41 
Greek —4.62 
Armenian . —4.68 
Japanese —4.93 
Mexican . —5.10 
Chinese —5.30 
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Hindus . —5.85 
Turk —5.82 
Negro —5.86 


How is the agreement about “foreign” groups 
to be interpreted? The first possibility is that 
the foreign groups possess varying degrees of 
undesirable qualities upon which most Amer- 
icans base their preferential ratings. But it is 
obvious that there are wide individual differ- 
ences within any nationality group—that is, not 
all Englishmen are alike, nor are all French- 
men, nor are all Russians. It is also obvious 
that few Americans have had much opportunity 
to know a large number of people from the 
many nationalities they dislike. It is also highly 
probable that if we were basing our judgments 


wholly upon what we know from actual contact _ 


with individual Spaniards, we would have dif- 
fering impressions of what Spaniards are really 
like, because we would not all have met the 
same type of Spaniard. Hence a more valid in- 
terpretation of the agreement of Americans 
about foreign groups is that it represents the 
prejudgments or prejudices, absorbed from the 
stereotypes of our culture. 

Thus the preferential disliking reported by 
Bogardus and Thurstone may reflect attitudes 
toward race names and may not arise from 
animosity toward the specific qualities inherent 
in the real human beings bearing a given racial 
label. We have learned responses of varying de- 
grees of aversion or acceptance to racial names 
and where these tags can be readily applied to 
individuals, as they can in the case of the Negro 
because of his skin color, we respond to him 
not as a human being but as a personification 
of the symbol we have learned to look down 
upon. Walter Lippmann has called this type of 
belief a stereotype—by which is meant a fixed 
impression which conforms very little to the 
facts it pretends to represent and results from 
our defining first and observing second. 


THE PRESENT STUDY? 


To explore the nature of racial and national 
stereotypes more fully, the following procedures 
were employed: 


1. Twenty-five students were asked to list as 
Many specific characteristics or traits as were 
thought typical of the following ten groups: 
Germans, Italians, Irish, English, Negroes, Jews, 


2 This study was made in 1932. 


Americans, Chinese, Japanese. Turks. No traits 
were suggested to the students. This list was 
then supplemented by characteristics commonly 
reported in the literature. The result was a final 
check-list of 84 descriptive adjectives. 

2. One hundred Princeton undergraduates 
were then asked to select the traits from this 
prepared list of 84 adjectives to characterize the 
ten racial and national groups. Specific direc- 
tions used in the experiment follow in part: 


Read through the list of words on page one 
and select those which seem to you to be typical 
of the Germans. Write as many of these words 
in the following spaces as you think are neces- 
sary to characterize these people adequately. If 
you do not find proper words on page one for 
all the typical German characteristics, you may 
add those which you think necessary for an 
adequate description. 


This procedure was then repeated for other 
national and racial groups. When the student 
had finished this he was asked to go back over 
the ten lists of words which he had chosen and 
to mark the five words of each list which seemed 
most typical of the group in question. 

3. Another group of students was asked to 
rate the list of adjectives on the basis of the 
desirability of these traits in friends and asso- 
ciates. The students making this judgment had 
no knowledge that the characteristics were sup- 
posed to describe racial groups. The traits or 
adjectives were rated from 1 to 10 on the basis 
of their desirability. 

4. Still another group of students was asked 
to put in rank order the ten racial and national 
groups on the basis of preference for association 
with their members. The group which the sub- 
ject most preferred to associate with was placed 
first and the group with which he preferred to 
associate least was placed tenth or last. 


RESULTS 


Stereotyped Conceptions of Ten Ethnic Groups 

Table 1 presents the twelve characteristics 
most frequently assigned to the ten races by 
the 100 students. This table summarizes the 
traits which students rechecked as the five most 
typical characteristics of each race. 

The traits most frequently assigned to the 
Germans seem consistent with the popular ste- 
reotype to be found in newspapers and maga- 
zines. Their science, industry, ponderous and 


TABLE 1. The Twelve Traits Most Frequently Assigned to Each of Various 
Racial and National Groups by 100 Princeton Students 


Traits Checked, Num- Per- Traits Checked, Num- Per- 
Rank Order ber cent Rank Order 1 ber cent 
GERMANS Sly 
Scientifically-minded 78 78 Loyal to family ties 15 15 
Industrious 65 65 Persistent 13 13 
Stolid 44 44 Talkative 13 13 
Intelligent 32 32 | Aggressive i FA alia 4 
Methodical $1 31 Very religious 12 12 
Extremely nationalistic — 24 24 
Progressive 16 16 | AMERICANS 
Efficient 16 16 Industrious 48 48 
Jovial 15 15 Intelligent 47 47 
Musical 13 13 Materialistic 33 33 
Persistent 11 11 Ambitious 33 33 
Practical 11 11 Progressive A O 
Pleasure-loving 26 26 
ITALIANS Alert 98 93 
Artistic 535. 59 | Efficient 21 2 
Re HE ja Aggressive 20 20 
Duicktenipered a a Straightforward 19 19 
Musical ia 4s Practical 19 19 
HERMA Sportsmanlike 19 19 
Imaginative 30 30 
Very religious 21 21 NEGROES 
Talkarire 21 21 | Superstitious 84 s4 
Revengeful 17 17 Lazy 75 15 
Physically dirty 18 13 Happy-go-lucky 38 38 
Lazy 12 12 | ignor 
Unreliable 11 11 siam er Á" 
Musical 26 26 
ENGLISH Ostentatious 26 26 
Sportsmanlike 58 53 Very religious 24 24 
Intelligent 46 46 | Stupid 22 22 
Conventional $4 34 Physically dirty 17 17 
Tradition-loving $1 81 Naive it4114 
Conservative 30 30 | Slovenly 15mi13 
Reserved 29 29 Unreliable 12 12 
Sophisticated 27 27 
Courteous 21 21 IRISH N 
Honest 20 20 Pugnacious 45 45 
Industrious 18 18 | Quick-tempered 39 39 
Extremely nationalistic 18 18 Witty 38 38 
Humorless 17 17 Honest 32 32 
Very religious 29. . 29 
JEWS Industrious 21 21 
Shrewd 79 79 Extremely nationalistic 21 21 
Mercenary 49 49 | Superstitious 18 18 
Industrious 48 48 Quarrelsome 14 14 
Grasping 34 34 Imaginative 13 13 
Intelligent 29 29 Aggressive 13 13 
Ambitious 21 21 Stubborn 13 13 
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TABLE 1 (Continued) 


Traits Checked, Num- Per- 
Rank Order ber cent 

CHINESE 
Superstitious 84 à 85 
Sly 29 30 
Conservative 29 30 
Tradition-loving 26 27 
Loyal to family ties 22 n (25 
Industrious 18 19 
Meditative 18 19 
Reserved 17 17 
Very religious 15005 
Ignorant 15 15 
Deceitful 14 14 
Quiet 13 13 
JAPANESE 
Intelligent 45 48 
Industrious 43 46 
Progressive 94 25 
Shrewd 22 23 
Sly 
Quiet 


methodical manner, and intelligence were 
pointed out by over one fourth of the students. 
Scientifically-minded was the most frequently 
assigned characteristic, as many as 78 percent 
of the group ascribing this trait to the Germans. 

Italians received the common characterization 
of the hot-blooded Latin peoples: artistic, im- 
pulsive, quick-tempered, passionate, musical and 
imaginative. The greatest agreement was shown 
on the artistic qualities of the Italians with 53 
percent of the students concurring in this belief. 

The characteristics ascribed to the Negroes 
are somewhat similar to the picture of the 
Negro as furnished by the Saturday Evening 
Post: highly superstitious, lazy, happy-go-lucky, 
ignorant, musical, and ostentatious. The great- 
est degree of agreement for a single trait for 
any racial group was reached when 84 percent 
of the students voted the Negroes superstitious. 
Laziness was given as a typical characteristic by 
three fourths of the students, but the other 
traits mentioned above had much lower fre- 
quencies of endorsement. It may be noted in 
passing that for a northern college, Princeton 
draws heavily upon the South for her enroll- 
ment so that this characterization of Negroes 
is not exclusively a Northern description. 


Traits Checked, Num- Per- 
Rank Order ber cent 

Imitative 17 18 
Alert 16 17 
Suave 16 17 
Neat 16 17 
Treacherous 13 14 
Aggressive 13 14 
TURKS 

Cruel 47 54 
Very religious 26 30 
Treacherous 21 24 
Sensual 20 23 
Ignorant 15 17 
Physically dirty 15 17 
Deceitful 13 15 
Sly 12 14 
Quarrelsome 12 14 
Revengeful 12 14 
Conservative 12 14 
Superstitious 11 13 


In the case of the Irish no single trait of the 
84 presented could be agreed upon as a typical 
Irish characteristic by half the students. Forty- 
five percent, however, thought pugnacity typical 
and 39 per cent agreed upon quick-tempered. 
Witty, honest, very religious, industrious, and 
extremely nationalistic were the other adjec- 
tives selected by a fifth or more of the students. 

The characterization of the English savors 
more of the English “gentleman” than of the 
general stereotype of John Bull. The leading 
characteristic is sportsmanship with an endorse- 
ment from 53 percent of the students. Forty-six 
percent of the students favored intelligence as 
typical of the English, 34 percent convention- 
ality, 31 percent love of traditions, and 30 per- 
cent conservatism. Other adjectives were re- 
served, sophisticated, courteous, and honest. 

The qualities of the competitive business 
world are used to describe the Jews. They are 
pictured as shrewd, mercenary, industrious, 
grasping, ambitious, and sly. Fifteen percent of 
the students did include Jewish loyalty to fam- 
ily ties. The greatest agreement (79 percent) 
was shown for shrewdness. 

The traits ascribed to Americans show a 
certain objectivity on the part of the students 
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in describing themselves, for the description 
given is not greatly at variance with the stereo- 
type held by non-Americans. Americans are 
described as industrious, intelligent, material- 
istic, ambitious, progressive, and  pleasure- 
loving. As in the case of the Irish the degree 
of agreement on these traits is relatively low. 
Almost one half did assign industry and intel- 
ligence to Americans, and a third gave materi- 
alistic and ambitious as the most descriptive 
adjectives. 

Apparently the general stereotype for the 
Chinese among eastern college students is fairly 
indefinite, for the agreement on typical Chi- 
nese characteristics is not great. Three of the 
100 students could give no characteristics for 
the Chinese. Of the 97 who did respond 35 
percent thought the Chinese superstitious, 30 
percent thought them sly, 30 percent regarded 
them as conservative. The next most frequently 
ascribed traits were love of tradition, loyalty 
to family ties, industry, and meditation. 

The picture of the Japanese seems more clear- 
cut with some recognition of the westernization 
of Japan. Emphasis was placed upon intelli- 
gence, industry, progressiveness, shrewdness, 
slyness, and quietness. The Japanese are the 
only group in which intelligence leads the list 
as the most frequently assigned characteristic. 
Forty-eight percent of the students filling in 
this part of the questionnaire gave intelligence 
as a typical Japanese trait. 

"Thirteen students could select no character- 
istics for the Turks. Fifty-four percent of those 
responding gave cruelty. Other traits selected 
described the Turks as very religious, treach- 
erous, sensual, ignorant, physically dirty, de- 
ceitful, and sly. 


Preferential Ranking of the Ten Groups 


The adjectives used to described the ten 
groups are a rough index of the esteem in 
Which they are held. More precise measures 
were furnished (1) by the direct ranking of the 
ten racial and national names in order of pref- 
erence (Table 2), and (2) by the desirability 
of the typical traits attributed to the ten groups 
(Table 8). 

The scores in Table $ are the average total 
value of the traits assigned to the various races, 
computed as follows: For every race the aver- 
age rating of a trait was multiplied by the 
number of times it was assigned to that race. 
The ratings of all the traits assigned to one 
race were added and divided by the total 


TABLE 2. Average Rank Order of Ten Racial 
Groups: Preferential Ranking 


Nationality Average Rank Order 
Americans 1.15 
English 2.27 
Germans 3.42 
Irish 3.87 
Italians 5.64 
Japanese 5.78 
Jews 7.10 
Chinese 7.94 
Turks 8.52 
Negroes 9.35 


TABLE 3. The Ranking of Ten Races on the 
Basis of the Rating of Their Alleged 
Typical Traits by 65 Students 


Average Value of 


Nationality Assigned Traits 
Americans 6.77 
English 6.26 
Germans 6.02 
Japanese 5.89 
Irish 5.42 
Jews 4.96 
Chinese 4.52 
Italians 4.40 
Negroes 3.55 
Turks 3.05 


number of assignments of traits to that race. 
This division would have been unnecessary if 
all the 100 students in the original group as- 
signing traits had assigned five traits to every 
race. In some cases, however, a student made 
less than five assignments. 

When we compare the ranking of the ten 
groups on the basis of preference for associ- 
ation with their members with their standing 
based on the desirability of traits attributed 
to them, we find a few changes in relative 
placement. The Italians drop from fifth to 
eighth place; the Irish drop two places, while 
the Japanese move up two places; and the 
Jews, Chinese, and Negroes move up one place. 
In other words, the Italians are regarded more 
highly and the Japanese are held in lower 
esteem than the qualities imputed to them 
would justify. 

It also is true that the ethnic groups are 
bunched much more closely together on the 
scores based on assigned traits than on the pref- 


———— 
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erence ranking. The preference ranking ac- 
corded to Americans is five times as desirable 
as that accorded to the Japanese, but the differ- 
ence in rating Americans and Japanese on the 
basis of imputed characteristics is nowhere 
nearly as great. In part this is an artifact of 
our method, but in part it is due to the fact 
that prejudice exceeds the rationalization of 
undesirable racial characteristics. Nonetheless 
there is marked similarity between the relative 
ranking on the basis of preference for group 
names and the average scores representing an 
evaluation of typical traits. 

Thus racial prejudice is part of a general 
set of stereotypes of a high degree of consis- 
tency and is more than a single specific reaction 
to a race name. The student is prejudiced 
against the label Negro because to him it 
means a superstitious, ignorant, shiftless per- 
son of low social status. The whole attitude is 
more than a simple conditioned response to 
the race name: it is a pattern of rationaliza- 
tions organized around the racial label. 

This does not mean that the rationalized 
complex is justified by objective reality—that 
is, that Negroes really are the type of people 
described by the stereotype. In fact the clear- 
ness or vagueness of the stereotyped concep- 
tion bear little relation to the degree of preju- 
dice expressed against a group as determined 
by its preferential ranking. 


Relative Clearness and Consistency 
of Pattern of Stereotypes 

Table 4 shows the clearness of the stereo- 
types about the ten groups in terms of the 
degree of agreement in assigning typical char- 
acteristics to them. 


TABLE 4. The Least Number of Traits Which 
Must Be Taken to Include 50 Percent of the 
Possible Assignments for Each Race 


Races, Number of 
Rank Order Traits Required 
Negroes 4.6 
Germans 5.0 
Jews 5.5 
Italians 6.9 
English 7.0 
Irish 8.5 
Americans 8.8 
Japanese 10.9 
Chinese 12.0 
Turks 15.9 


——————————— 


"Table 4 lists the least number of traits which 
have to be included to find 50 percent of the 
500 possible votes cast by the 100 students in 
the case of every racial and national group. 
It will be remembered that each student was 
allowed to select 5 of the 84 traits presented 
and that there were 100 students. If there were 
perfect agreement, 2.5 traits would have re- 
ceived 50 percent of the votes. Perfect disa- 
greement or chance would mean that 42 traits 
would be necessary to give half of the votes. 
Table 4 shows that in the case of Negroes we 
can find 50 percent of the votes or selections 
of traits in 4.6 traits. The agreement here is 
very high and even in the case of the Turks 
where 15.9 traits must be included to give 50 
percent of the possible 500 assignments or 
selections the voting is far from a chance 
selection. 

Thus in Table 4 we have a comparison of 
the definiteness of the ten racial stereotypes. 
The most definite picture is that of the Negroes. 
'The Germans and the Jews also give consistent 
patterns of response, while the Chinese, Japa- 
nese, and Turks furnish the least clear cut 
stereotypes. : 

Though the belief in the undesirable quali- 
ties of a national group bolsters the prejudice 
against the group, it is not necessary to have 
a well worked out set of such rationalizations 
to obtain expressions of extreme prejudice. 
In fact Table 4 shows little relation between 
degree of disliking and the definiteness of the 
stereotyped picture. Negroes and Turks both 
are held in the lowest esteem, yet they repre- 
sent opposite extremes in sharpness of stereo- 
type. Students agreed among themselves most 
closely in characterizing Negroes and disagreed 
most in characterizing Turks. But they were in 
agreement in putting both groups at the bot- 
tom of the list as least desirable as companions 
or friends. 


SUMMARY 


l. Ten ethnic groups were placed in rank 
order by Princeton students on the basis of 
preference for association with their members. 
The preferential ranking was similar in its 
main outline to the results reported by investi- 
gators in all parts of the United States. Minor 
exceptions occurred in the case of the Jews and 
Japanese, who were placed somewhat, lower 
and higher, respectively, than in other studies. 

2. Students not only agreed in their prefer- 
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ential ranking of ethnic groups, but they also 
agreed in the types of characteristics attributed 
to these groups. In fact the conception of 
“foreign” groups is so stereotyped that it can- 
not be based upon actual contact with or direct 
knowledge of the groups in question. 

3. The clearness or definiteness of the stereo- 
typed picture is not related to the degree of 
prejudice. The greatest prejudice is expressed 
against Negroes and Turks. The stereotyped 
picture of the Negro is very clear-cut while 
that of the Turk is the vaguest of any of the 
ten groups included in the study. 


4. A list of 84 traits given as the typical 
characteristics of the ten nationalities by a 
group of students was rated by another group 
of students on the basis of their desirability in 
associates. From these ratings scores were as- 
signed to the ten nationalities, the relative 
weight of which agreed closely with the pref- 
erential ranking. Racial prejudice is thus a 
generalized set of stereotypes of a high degree 
of consistency which includes emotional re- 
sponses to race names, a belief in typical char- 
acteristics associated with race names, and all 
evaluation of such typical traits. 


5. Attitudes and Cognitive Organization 


FRITZ HEIDER 


Attitudes towards persons and causal unit 
formations influence each other. An attitude 
towards an event can alter the attitude towards 
the person who caused the event, and, if the 
attitudes towards a person and an event are 
similar, the event is easily ascribed to the 
person. A balanced configuration exists if the 
attitudes towards the parts of a causal unit are 
similar (1). 

It is tempting to generalize from this state- 
ment and to omit the restriction to causal unit 
formation. Do units in general interact with 
attitudes in a similar way? 

In trying out this hypothesis we shall under- 
stand by attitude the positive or negative rela- 
tionship of a person p to another person o, 
or to an impersonal entity x which may be a 
situation, an event, an idea, or a thing, etc. 
Examples are: to like, to love, to esteem, to 
value, and their opposites. A positive relation 
of this kind will be written L, a negative 
one ~ L. Thus, pLo means p likes, loves, or 
values o, or, expressed differently, o is positive 
for p. 

The relation "unit" will be written U. 
Examples are: similarity, proximity, causality, 
membership, possession, or belonging. pUx can 
mean, for instance, p owns x, or p made x; 
p~ Ux means p does not own x, etc. Other 
relations which, in many ways, seem to func- 
tion like units are: p is familiar with, used to, 
or knows well o or x, and p is in situation x. 
In lumping together all these relations we are, 
of course, aware of the dissimilarities between 
them. Only in a first approximation can they 
be treated as belonging to one class. 

The hypothesis may be stated in greater de- 
tail thus: (a) A balanced state exists if an entity 
has the same dynamic character in all possible 
respects (e.g., if p admires and at the same time 


* Reprinted from Journal of Psychology, 1946, 21, 
107-112, with permission of the author and the 
publisher. 


likes o); in other words, if pLo or p ~ Lo is 
true for all meanings of L. (We may anticipate 
here that the analogous statement for pUo does 
not seem to hold in a general way.) (b) A bal- 
anced state exists if all parts of a unit have 
the same dynamic character (i.e., if all are posi- 
tive, or all are negative), and if entities with 
different dynamic character are segregated from 
each other. If no balanced state exists, then 
Íorces towards this state will arise. Either the 
dynamic characters will change, or the unit 
relations will be changed through action or 
through cognitive reorganization. If a change 
is not possible, the state of imbalance will 
produce tension. 

The first part of the hypothesis refers to 
influence of dynamic relations or attitudes on 
each other. Since the different dynamic rela- 
tions are not included in each other logically 
(“p likes o" does not imply “p admires 0”), the 
same 0 or x can be positive in one respect and 
negative in another. An example in point is 
the conflict between duty and inclination. A 
tendency exists to make the different dynamic 
relations agree with each other by means of 
cognitive restructuring (excuses or rationaliza- 
tions). Another example would be the tendency 
to admire loved persons and to love admired 
persons. 

More numerous are the possibilities to which 
the second part of the hypothesis refers. They 
can be grouped according to the entities mak- 
ing up the configurations: (a) person and non- 
person (p, x); (b) two persons (p, 0); (c) two 
persons and a non-person (p, o, x); (d) three 
persons (p, o, q). Many of the examples seem: 
to substantiate the hypothesis. Examples which 
do not fit may eventually lead to greater in- 
sight into the nature of the dynamic characters 
and of the unit relations. All examples refer 
to p's life space. This is true even of oLp which 
therefore means: p thinks that o likes or ad- 
mires p. 

39 


40 HISTORICAL FOUNDATIONS 


(a) p and x. Since the own person (p) is 
usually positive, a balanced state will exist if 
p likes what he is united with in any way, or 
if he dislikes the x he is segregated from. The 
cases (pLx) + (pUx) and (p~ Lx) + (p ~ Ux) 
are balanced. Examples: p likes the things he 
made; p wants to own the things he likes; p 
values what he is accustomed to. 

(b) p and o. Analogously, the two balanced 
states for p and o will be: (pLo) + (pUo) and 
(p ~ Lo) + (p~ Uo). Examples: p likes his 
children, people similar to him; p is uneasy if 
he has to live with people he does not like; 
p tends to imitate admired persons; p likes to 
think that loved persons are similar to him. 

pUo is a symmetrical relation, i.e., pUo im- 
plies oUp. That they belong to a unit is true 
for p and o in the same way, though their róles 
in the unit may be different (for instance, if 
U is a causal unit). However, pLo is non- 
symmetrical since it does not imply oL. It is 
in line with the general hypothesis to assume 
that a balanced state exists if pLo and oLp 
(or p~ Lo and o ~ Lp) are true at the same 
time. Attraction or repulsion between p and o 
are then two-way affairs; the relation is in 
symmetrical harmony. pLo is a non-symmetri- 
cal relation logically, but psychologically it 
tends to become symmetrical. Examples: p 
wants to be loved by an admired o; p dislikes 
people who despise him. oLp is similar to pLo 
in its relation to Uo. Examples: p likes to 
meet people who, he is told, admire him. 

(©) b, o, and x. The combinations become 
more numerous with three entities making up 
the configurations. Only a few possibilities can 
be mentioned. We shall always give the bal- 
anced state in symbols before stating the exam- 
ples which refer to it. 

(pLo) + (pLx) + (oUx). Both o and x are 
positive and parts of a unit. Examples: p ad- 
mires clothes of loved 0; b wants to benefit his 
friend o; p likes to think that his friend bene- 
fits him. A seeming exception is the case of 
envy. If o owns x (oUx) and p likes x (pLx), 
p ~ Lo may often follow. This exception can 
be derived from the fact that ownership is a 
one-many relation. A person can own many 
things but each thing can, ordinarily, be owned 
only by one person. Therefore "o owns x" ex- 
cludes “p owns x,” or oUx implies p ~ Ux. 
Since pLx may tend toward pUx, conflict is 
introduced. 

Implications between unit relations often 
lead to conflict. Lewin's three cases of inner 
conflict rest on implications. Approach to a 


positive valence may imply withdrawal from 
another positive valence. Withdrawal from a 
negative valence may imply approach to an- 
other negative valence. Finally, approach to a 
positive valence may imply approach to a neg- 
ative valence if both are located in the same 
region. Analogously, one can talk of three cases 
of outer conflict between persons. pUx may 
imply o — Ux (for instance, if U means owner- 
ship), and if both want x, conflict (competi- 
tion) will arise. In the same way conflict ap- 
pears if p and o want to get away from x but 
only one of them can do so (if p — Ux implies 
oUx, and vice versa). Lastly, it may happen 
that p likes x and o hates it, but p and o have 
to move together (pUx implies oUx, eg., in 
marriage). They either can both have x, or 
both not have it. 

Trying out variations of the triad (pLo) 4- 
(pLx) + (oUx), we find that (pLo) + (pLx) + 
(oLx) also represents a balanced case. Exam- 
ples: p likes what his friend o likes; p likes 
people with the same attitudes. This case is 
not covered by the hypothesis unless we treat 
L as equivalent to U. Actually, in many cases 
the effects of L and U in these configurations 
seem to be the same. Furthermore, this case 
shows the psychological transitivity of the L 
relation. A relation R is transitive if aRb and 
bRc imply aRc. Thus, p tends to like x if pLo 
and oLx hold. As in the case of the symmetry 
of the pLo relation, we again have to stress 
the difference between logical and psychological 
aspect. Logically, L is not transitive but there 
exists a psychological tendency to make it tran- 
sitive when implications between U relations do 
not interfere with transitivity. The relation U, 
too, seems to be in this sense psychologically 
transitive. (pUo) + (oUx) can lead to pUx; p 
feels responsible for what people belonging to 
him do. 

"Taking into account these considerations, we 
can reformulate the hypothesis: (a) In the case 
of two entities, a balanced state exists if the 
relation between them is positive (or negative) 
in all respects, i.e., in regard to all meanings 
of L and U. (b) In the case of three entities, a 
balanced state exists if all three possible rela- 
tions are positive in all respects. 

The question arises whether, with a triad, 
one can make any generalizations about bal- 
anced cases with negative relations. For in- 
stance, (pLo) + (o~ Ux) + (p~ Lx) is bal- 
anced, Examples: p likes o because o got rid of 
something p dislikes. In this case two entities, 
p and o, are related positively to each other, 
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while both are related negatively to the third 
entity x. This holds generally: the triad of 
relations is in balance, if two relations are 
negative and one positive. This statement can 
be derived from the assumption that L and U 
are, in a balanced configuration, exchangeable, 
symmetrical, and transitive. L and U can then 
be treated as formally analogous to an identity 
relation. The “balanced” cases with three terms 
are for this relation: a=b, b=c, a—c; 
a=b, b#c, as* c; axb, bxc, axe. By 
substituting L or U for the identity sign one 
obtains the balanced cases for these relations, 
though the case with three negative relations 
does not seem to constitute a good psycho- 
logical balance, since it is too indetermined. 

Therefore, the second part of the hypothesis 
must be stated as follows: (b) In the case of 
three entities, a balanced state exists if all 
three relations are positive in all respects, or 
if two are negative and one positive. 

(pLo) + (oLx) + (pUx). Examples: p likes 
o because o admired p's action; p wants his 
friend o to like p’s production; p wants to do 
what his friends admire. 

(pUo) + (pLx) + (oLx). Examples: p wants 
his son to like what he likes; p likes x because 
his son likes it. 1 

(d) p, o, and q. Among the many possible 
cases we shall only consider one. (pLo) + 
(oLq) + (pLq) Examples: p wants his two 
friends to like each other. This example shows, 
as the parallel case with x instead of q, the 
psychological transitivity of the Z relation. 

However, the transitivity of the L relation 
is here restricted by implications between unit 
relations when L represents a one-one love 
relation. p does not want his girl friend o to 
fall in love with his boy friend q because oLq 
in this case implies o — Lp, which conflicts 
with pLo. Jealousy, as well as envy and com- 
petition, is derived from implications between 
unit relations. 

After this discussion of the different possi- 
bilities there are several more points worth 
mentioning which refer to examples of dif- 
ferent groups. One is the problem of self eval- 
uation. High self regard of p can be expressed 
by pLp, low self regard by p ~ Lp (though the 
two p's in these expressions are not strictly 
equivalent), All of the examples so far con- 
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sidered presupposed pLp. However, one also 
has to take into account the possibility of 
p —Lp. As to be expected, it plays a rôle con- 
trary to that of pLp. Examples: if p has low 
self regard he might reject a positive x as too 
good for him; if p has guilt feelings he will 
think he ought to be punished; if his friend 
admired his product he will think it only 
politeness. A negative action attributed to him- 
self will produce p — Lp, etc. 

The equivalence of the L and U relations 
seems to be limited by the fact that often the 
U relation is weaker than the L relation. One 
can assume, that pLx brings about pUx (p 
wants to have a thing he likes) more often 
than pUx produces pLx (p gets to like a thing 
which belongs to him). Again (pLo) + (oLx) 
usually will lead to pLx (transitivity), but 
(pUo) + (oUx) will not do so if there holds 
at the same time p — Lo. 

We saw that one can derive forces towards 
actions, or goals, from the configurations. It 
can also happen, that the choice of means to 
a goal is determined by these patterns. If p 
wants to produce oLx, and he knows that oLp 
holds, he can do so by demonstrating to o the 
relation pLx, because (oLp) + (pLx) will lead 
to oLx. If p wants to bring about oLp, and 
he knows that oLx holds, he can produce pUx, 
for instance, he will perform an act o approves 
of. 

An examination of the discussed examples 
suggests the conclusion that a good deal of 
inter-personal behavior and social perception 
is determined—or at least co-determined—by 
simple cognitive configurations. This fact also 
throws light on the problem of the under- 
standing of behavior. Students of this problem 
often mentioned the aspect of rationality which 
enters into it. Max Weber and others pointed 
out one kind of rationality in behavior, namely, 
the rationality of the meansend relation. 
Choosing the appropriate means to gain an 
end makes for a "good," a "rational" action, 
and we can understand it. In Lewin's concept 
of hodological space this kind of rationality is 
elaborated. However, understandable human 
behavior often is not of this sort, but is based 
on the simple configurations of U and L rela- 
tions. Since they determine both behavior and 
perception we can understand social behavior 
of this kind. 


Heider, F. Social perception and phenomenal causality, Psychol. Rev. 1944, 51, 358-374. 
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6. The Behavior of Attitudes 


LEONARD W. DOOB 


There is no question that the subject of atti- 
tude and attitude measurements is important 
in sociology and social psychology. Social scien- 
tists continue to discuss the nature of attitudes 
in articles like this, to conduct experiments 
which show that behavior is affected by atti- 
tude, and to measure attitudes for theoretical 
or practical purposes. The problem of what 
an attitude is and how it functions, neverthe- 
less, persists and—as many writers on attitudes 
likewise point out in their introductory para- 
graph—little explicit agreement is apparent 
in the published literature. 

The purpose of this paper is not to criticize 
other definitions or usages of the term but 
systematically, if partially, to relate the concept 
of attitude to what is known as behavior 
theory (10)! Almost all writers, no matter 
what their bias, agree that attitudes are 
learned. If this is so, then the learning, reten- 
tion, and decline of an attitude are no differ- 
ent from the learning of a skill a piece of 
prose, or a set of nonsense syllables; and they 
must also involve the problems of perception 
and motivation. 

Immediately it is necessary to raise and 
answer the question as to why a simple, com- 
monsense, ubiquitous concept like attitude 
should be translated into semi-technical jargon. 
"There are at least two answers to the question. 
The first and less important answer involves 
scientific methodology: it is thought desirable 
to bring as many terms as possible relating to 


* Excerpted from an article in Psychological Re- 
view, 1947, 54, 135-156, with permission of the 
author and the American Psychological Association. 


1 The writer is deeply grateful especially to Neal 
E. Miller as well as to Irvin L. Child, John Dollard, 
and Mark A. May for their constructive criticisms. 
He has promiscuously and deliberately borrowed 
some of their ideas and, as an insignificant token 
of his gratitude, herewith absolves them of any 
responsibility for the final product. 


a field of research (in this instance, human 
behavior) within one universe of discourse. 
Misunderstandings result from using one set 
of concepts to describe perception, learning, 
and motivation on one level and another set 
on a different level. Even if some sociologists 
who use the concept "attitude" are not at- 
tracted by the terminology of behavior theory 
and if they remain inclined merely to assume 
perception, learning, and motivation without 
inquiring into the details of the processes, uni- 
fied knowledge concerning human behavior 
requires that there be a connection between 
what the sociologist studies or measures and 
what the psychologist studies or measures. Sec- 
ondly and of crucial importance, a tentative 
translation of a term from one level to another 
and into an already and perhaps better devel- 
oped theoretical system is fully justified if 
thereby inadequacies on the higher level can 
be pointed out. Murray and Morgan (16), for 
example, have recently shed new light on the 
nature of attitude by incorporating the closely 
related concept of sentiment into the former's 
previous conceptualization of personality. 

The procedure to be followed in this paper 
is as follows: 


1. A definition of attitude in behavioral 
terms will be given. It is felt that such a defi- 
nition represents an advance beyond the stage 
of defining an attitude as the subjective coun- 
terpart of something in the environment, as à 
predisposition within the organism, or as being 
what the attitude scale measures. The psycho- 
logical implications of the definition will be 
made clear. 

2. The consequences of this definition and 
the theoretical structure it assumes will then 
be summarized briefly by calling attention to 
the factors which should be known to make 
a completely adequate analysis of attitudes. 

3. Illustrative research employing the com- 
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cept of attitude will be critically examined in 
terms of these factors. Such a detailed exami- 
nation seems preferable to surveying the atti- 
tude literature in general terms. 


DEFINITION OF ATTITUDE 


Attitude is defined as an implicit, drive- 
producing response considered socially signifi- 
cant in the individual's society. This definition 
states, in effect, that from the psychological 
point of view attitude is an implicit response 
with drive strength which occurs within the 
individual as a reaction to stimulus patterns 
and which affects subsequent overt responses. 
Since psychologists and other social scientists 
sometimes disagree concerning the nature and 
attributes of an implicit response of this type, 
the definition is therefore elaborated and 
broken down typographically into the phrases 
and clauses requiring further definition, elabo- 
ration, and discussion: 

“An attitude is: 


(1) “an implicit response 

(2) “which is both (a) anticipatory and 
(b) mediating in reference to patterns of overt 
responses, * 

(3) "which is evoked (a) by a variety of 
stimulus patterns (b) as a result of previous 
learning or of gradients of generalization and 
discrimination, 

(4) "which is itself cue- and drive-produc- 
ing, 

(5) "and which is considered socially signifi- 
cant in the individual's society." 


1. “An attitude is an implicit response. . . - 
By an implicit response is meant a response 
occurring within the individual and not imme- 
diately observable to an outsider. Motor “atti- 
tudes” like the physical set of the runner be- 
fore the starter's gun is fired, therefore, are not 
included within the universe of discourse be- 
ing analyzed because they are not entirely im- 
plicit and hence are instrumental rather than 
mediating acts. The semantic connection of 
such sets with the concept of attitude employed 
by social scientists is considered to be largely 
fortuitous from an historical standpoint. 

Overt behavior that is observable to an out- 
sider may be affected by the evoked attitude 
but is not here defined as the attitude itself. 
Attitude refers to the individual's immediate 
but implicit response to a stimulus pattern 
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and his consequent tendency to respond still 
further as a result of that implicit response. 
Such an implicit response may be conscious or 
unconscious, distinctly verbal or vaguely pro- 
prioceptive. What is expressed results not from 
the attitude alone but represents, as will be 
indicated, another response in a behavior se- 
quence—an overt one—which is a function of 
the attitude-response and other tendencies 
within the individual. 

2(a). “An attitude is an implicit response 
which is ... anticipatory . . . in reference to 
patterns of overt responses. . . .” An anticipa- 
tory response—called also an antedating re- 
sponse by Hull (10, p. 74)—is one which orig- 
inally preceded another rewarded response and 
which, as a result of being associated with or 
producing this reward, has been reinforced so 
that it occurs before its "original time in the 
response series" (14, p. 49). If an individual, 
for example, dislikes a fruit or a person, he 
tends to avoid eating the fruit or meeting the 
person. Originally the avoidance occurred only 
after actual contact had been established and 
after that contact had proven to be punishing 
and the withdrawal to be rewarding. When a 
thorough investigation reveals no actual prior 
contact, some process of generalization or dis- 
crimination must have occurred since all be- 
havior has antecedents. The possibility of 
exceptions under different psychological con- 
ditions must be noted, even in these trivial 
illustrations: the fruit may be eaten anyhow if 
the individual is very hungry and the person 
may be met and greeted if social circumstances 
so require. 

The conditions under which a response 
moves up in the behavior sequence and be- 
comes anticipatory have so far been determined 
concretely in relatively simple situations. It 
has been shown (14, p. 79), for example, that 
the closer the response is in the series leading 
to the reward, the more likely it is to be 
learned and then subsequently to antedate 
other responses not leading to the reward. This 
principle suggests why few objects or indi- 
viduals in society fail to arouse attitudes. 
Originally the individual has had to react to 
them or has been taught to react to them in the 
course of being socialized. One of the responses 
leading to the goal response (which by defini- 
tion involves reward or the avoidance of pun- 
ishment) has implicit components, is reinforced, 
and is here called an attitude. 

2(b). “An attitude is an implicit response 
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which is . . . mediating in reference to patterns 
of overt responses. . . ." Whereas the anticipa- 
tory character of attitude indicates its temporal 
relation to a goal, its mediating attribute calls 
attention to its functional connection with that 
goal. A mediating response is made in an at- 
tempt to increase the likelihood of the occur- 
rence of reward rather than punishment in 
connection with a goal response. In reasoning, 
for example, implicit responses intervene be- 
tween the original stimulus pattern and the 
goal response and may assist the individual in 
achieving that goal. Attitudes can be evoked 
so easily because as mediating responses in- 
volving only language, imagery, or proprio- 
ceptive reactions they need not conflict with 
the overt behavior of the individual or with 
his environment. 

The mediating function of attitude has led 
May to suggest that attitude is “a kind of sub- 
stitute goal response" which "arises when the 
goal response cannot be immediately and easily 
made” (13). This attribute, which has also been 
suggested by other writers (e.g. 2, pp. 425-426 
and 17, pp. 28-29), does seem to characterize 
certain attitudes. The individual who dislikes 
another person is restrained or restrains him- 
self from hurting his antagonist; instead he 
makes an implicit response involving aggres- 
sion and feelings of avoidance or repulsion. 
It is felt, however, that all attitudes cannot 
be so characterized. The liked object, for 
example, evokes an implicit response which 
facilitates rather than acts as a substitute for 
overt behavior in reference to it. Overt be- 
havior, in short, may be mediated by attitudes 
almost immediately and there need not neces- 
sarily be a conflict or a restraint before the 
attitude is evoked. 

Three consequences arise immediately from 
this conceptualization of attitude. In the first 
place, it appears psychologically futile—as 
Sherif and Cantril have indicated (20, pp. 304- 
305)—to attempt to classify attitudes. Responses 
can be characterized in so many different ways 
that a simple dichotomy or trichotomy usually 
must be willfully stretched if it is to include 
all types of behavior. The response defined as 
attitude might be called positive or negative 
or be said to involve approach or withdrawal 
were it not for the fact that these terms then 
require further definition which cannot be 
consistently or usefully applied to all situations. 
Both approach and avoidance, for example, 
may be but are not necessarily involved in 


what has been called a neutral attitude or in 
any attitude for that matter. It seems better, 
in short, to apply socially useful labels to atti- 
tude as the need arises, in order to indicate the 
direction in which the individual thereby is 
oriented; but it should be clearly recognized 
that the labels have social and not psychologi- 
cal significance. A psychologically important 
distinction, however, is that between general 
and specific attitudes (3), a distinction which 
refers to the stimulus patterns evoking the 
response or to the evoked responses. 

In the second place, reference is made in the 
definition to "patterns of overt responses." For 
reasons hereafter suggested, overt behavior can 
seldom be predicted from knowledge of atti- 
tude alone. Under varying conditions within 
the individual, a given attitude can mediate a 
repertoire of overt responses. A favorable atti- 
tude toward a social institution, for example, 
can mediate innumerable responses connected 
with what is considered to be the welfare of 
that institution. 

Then, thirdly, this definition of attitude 
emphasizes its acquired or learned character. 
There are no psychic rays which enable the 
investigator, even though he be equipped with 
a poll or a scale, to determine the ‘strength’ 
of an attitude, the overt responses with which 
it has become associated, or its present func- 
tioning within the personality. Such knowledge 
can be obtained only from knowing approxi- 
mately under what conditions the attitude was 
acquired in the first place and the extent to 
which it secures present and future reinforce- 
ment. The learning process, therefore, is cru- 
cial to an understanding of the behavior of 
attitudes. 

The nature of that process cannot be ignored 
in a treatise on attitude as it has been by 
Sherif and Cantril: "Just what the psychologi- 
cal or physiological mechanisms of this learn- 
ing may be are irrelevant to the present dis- 
cussion,” they write in their articles on “The 
Psychology of 'Attitudes' " (20, p. 302). It is 
difficult to see how psychologists can call these 
mechanisms "irrelevant" and still contend, as 
the authors do in the introduction to the same 
articles, that the "task" of the psychologist is 
to "give an adequate account of the psycho- 
logical mechanisms involved in the formation 
of an attitude in any individual" (20, p. 295). 
The authors rest their case by stating that they 
“do not need to take sides in favor of any 
learning theory" (20, p. $07). Of course, as they 


write, “the primary stage in the formation of 
attitudes is a perceptual stage,” which is merely 
saying that there must be a stimulus and that 
the stimulus must be perceived before there 
can be any kind of learning. 

In contrast, Allport (1, pp. 810-812) has set 
the problem of the genesis of attitudes in 
terms which explicitly suggest the need for 
learning theory. His often quoted summary of 
the literature indicates that attitudes may be 
formed through “the integration of numerous 
specific responses of a similar type”; “indi- 
viduation, differentiation, or segregation" of 
experiences; "trauma"; and the adoption of 
"ready-made" attitudes of others. The genesis 
of almost any attitude is undoubtedly more or 
less unique. Society sets the rewards and pun- 
ishments regarding much of overt behavior; 
the individual being socialized then is forced 
into the learning situation; although he reacts 
to the situation uniquely, one of the end prod- 
ucts—the attitude—he may share in large 
degree with others. 

3(a). “An attitude is an implicit response 

. which is evoked by a variety of stimulus 
patterns. . . .’ The stimuli evoking an attitude 
may be in the external world or within the 
individual. The latter range from a verbal re- 
sponse to an autonomic disturbance or a drive. 
The existing literature on attitudes testifies to 
the fact that the stimuli may be various. Such 
is the assumption behind any attitude scale in 
which a variety of situations is judged or be- 
hind a distinction like that between specific 
and general attitudes. 

The arousal of an attitude involves two tra- 
ditional problems in psychology, those of per- 
ception and learning. The two are interrelated 
and can be separated only for purposes of 
analysis. Perception indicates that the indi- 
vidual is responding because he has previously 
paid attention to or been oriented toward cer- 
tain stimuli which then affect his sense organs 
and thus evoke his,attitude. Learning in this 
connection emphasizes the reasons in the past 
history of the individual which have brought 
about the bond between the stimulus pattern 
and the attitude. 

Gestalt psychologists especially point out 
that paying attention and then perceiving oc- 
cur in many instances as a result of the indi- 
vidual's set to respond: letter boxes are not 
noticed unless the person has a letter to post, 
etc. Writers on attitude are fond of recalling 
that the tradition of set is somewhat hoary in 


THE BEHAVIOR OF ATTITUDES 45 


academic psychology, in order to demonstrate 
their own respectability and their acquaintance 
with long German terms like Bewusstseinslage. 
In other words, perceiving depends upon drive 
or set (which orients the individual to respond 
to certain stimuli and then to respond to them 
in a particular way) as well as upon the ar- 
rangements of the external stimuli. Attitude 
may be included among the sets determining 
both the orientation of the individual as well 
as the kind of perceptual response he makes: 
The southerner notices incipient aggression in 
a Negro which a northerner will overlook. 

It must be recognized, however, that in some 
situations the individual's attitude is not 
evoked until the stimulus has been actually 
perceived: It affects perception after its arousal 
but does not orient him originally in the direc- 
tion of the stimuli involved. The southerner 
who has learned to discriminate incipient hos- 
tility from genuine docility among Negroes can 
make the discrimination when confronted with 
a Negro. It is on the basis of this discrimina- 
tion that his attitude toward the Negro is or 
is not aroused. If he goes about the streets 
looking for hostility in Negroes, he may be set 
to make the discrimination not necessarily by 
his attitude toward Negroes or by his ability 
to detect such behavior but by some particular 
drive which has been previously aroused and 
which may or may not involve Negroes. To say 
that he searches for aggression solely because 
of his attitude toward Negroes is to fail to 
distinguish him from another southerner who 
has a more or less identical attitude (so far as 
content and even “affect” are concerned) but 
who has no “chip on his shoulder.” In this 
case the sensible problem remains of account- 
ing for the "chip." 

In controlled experimental situations dedi- 
cated to observing the behavior of attitudes, 
the attention of the subjects is secured by the 
experimenter not through arousing the attitude 
being studied but through the evocation of 
some other drive. Subjects perceived the auto- 
kinetic phenomenon of Sherif (19) or they 
looked at Seeleman's photographs of white and 
colored individuals (18) because they had 
agreed to cooperate with the experimenter. 
This was the drive which oriented them in the 
experimental situation. Then their attitudes 
were aroused and these attitudes affected both 
their perceptual responses and the reports they 
gave the experimenters. Later on in the experi- 
ments, their evoked attitudes may have had 
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sufficient drive strength also to orient them 


selectively. 
3(b). “An attitude is an implicit response 
... which is evoked .. . as a result of previous 


learning or of gradients of generalization and 
discrimination. . . .” The previous section has 
indicated the possible relation between percep- 
tion and the evocation of an attitude and 
therein it was suggested that an attitude can 
almost always be aroused by a variety of 
stimuli. Here it is stated that previous learning 
determines whether or not particular stimulus 
patterns will evoke the attitude. Some stimuli 
come to arouse an attitude after a relatively 
simple process of conditioning has occurred. 
As a result of being originally present in the 
situation, for example, the wrapper of the dis- 
liked fruit or the signature of the disliked 
person may produce an anticipatory response 
which mediates the goal response of avoiding 
it or him. Other stimuli evoke or fail to evoke 
an attitude not because of their presence or 
absence in previous situations but because they 
fall along a stimulus gradient of generalization 
or discrimination. If an individual likes or 
dislikes one particular Negro, there is stimulus 
discrimination; if all Negroes are involved, 
there is stimulus generalization along a gradi- 
ent, for example, of skin color; if only certain 
“types” of Negroes arouse the attitude, there 
is generalization and discrimination. 
Frequently attitudes are thought to be puz- 
zling and mysterious because they can be 
aroused when the psychologist, the sociologist, 
the layman, and the individual least expect 
them to be, The stimulus pattern, for example, 
may be a word or a sentence, or some other 
symbol (like a flag, a face, or a gesture) which 
represents only a portion of the original stim- 
ulus pattern. The behavior of attitudes under 
these circumstances, however, becomes more 
intelligible when the implications of the 
“gradient of generalization and discrimina- 
tion" are understood. This gradient is espe- 
cially efficacious in the case of attitude since 
language can readily perform the function of 
mediating generalization and discrimination. 
Cofer and Foley (7), for example, are able 
a priori to list over fifty gradients along which 
generalization or discrimination from a single 
word can conceivably occur in purely formal 
(ie. in non-idiosyncratic) manner. The word 
"Negro," for example, through previous learn- 
ing may become the one part of the origi- 
nal stimulus pattern which evokes an attitude 


regarding Negroes. Without any difficulty there 
can be a semantic gradient including words 
like “colored,” “African,” and the various epi- 
thets and appellations applied to this, group. 
Through a slightly more complicated process, 
eventually "zoot suit" may arouse the same 
attitude: if the individual has associated this 
type of clothing with Negroes, the response of 
seeing such clothing or hearing its name evokes 
the internal response of "Negro" which in turn 
has the property of arousing the attitude. This 
phenomenon has been called secondary stimu- 
lus generalization or acquired cue value. Simi- 
larly, thinking can aid generalization and 
discrimination. 

Cantril in his work on attitudes fails to 
comprehend this stimulus gradient involved in 
most if not all learning and conditioning. He 
states, for example, that what he calls “a stan- 
dard of judgment” must not be “confused with 
a conditioned response” (5, p. 25). What he 
means by “a standard of judgment” or a “frame 
of reference" he never makes operationally 
clear, although they seem to be related to atti- 
tude since they are employed profusely in his 
two articles on the subject. It may be that 
terms like these—to use Cantril's own words 
in reference to "conditioning"—are “so loosely 
employed" that they "explain nothing at all" 
(5, p. 55). His conception of a conditioned re- 
sponse is indeed narrow: "a specific reaction 
to a specific stimulus" (5, p. 25). In spite of 
a general, noncommittal reference in a foot 
note to a standard book on conditioning (that 
of Hilgard and Marquis) and an irrelevant 
quotation from A. N. Whitehead, this simpli- 
fication of behavior theory leads him to con- 
clude that such a theory is "by no means ade- 
quate to explain the apparent meaningfulness 
of man's experience" (5, p. 56). More specifi- 
cally, for example, he maintains that the "anal- 
ysis of the backgrounds of the people who 
became panicky listening to the Hallowe'en 
broadcast [of Orson Welles in 1938] clearly 
shows that in no way had they been specifically 
conditioned against Martian invaders’ mowing 
down people on this planet” (5, p. 25). The 
point of Cantril’s study (4), however, is to dis- 
cover the previously existing attitudes and 
traits which made people prone to panic or 
sanity. After discovering many of them with 
admirable ingenuity, he gives a commonsense 
"explanation" of the behavior; i.e., he uses his 
particular vocabulary to describe what hap- 
pened (4, pp. 190-201). His failure to apply 


Chapter 8 of Hilgard and Marquis (8) to his 
own data prevents him from realizing that at 
the time of the broadcast people had learned 
to respond with anxiety to various stimuli as 
a result, for example, of the “war scare” asso- 
ciated with the Munich crisis (cf. 4, pp. 159- 
160). Certain Americans, some of whose socio- 
economic and psychological characteristics Can- 
tril has indicated, apparently generalized from 
these stimuli to a broadcast which of course 
they had not previously experienced but which 
nevertheless was sufficiently similar to other 
stimulus patterns previously evoking the anxi- 
ety. Such individuals “jumped the gun” by 
behaving—after their attitudes had been 
aroused by Mr. Welles and his associates— 
as they would have in the face of genuine 
catastrophe. 

From experiments in other fields of behavior, 
Hull states that the amplitude of a response 
"diminishes steadily with the increase in the 
extent of deviation . . . of the evocation stimu- 
lus . . . from the stimulus originally condi- 
tioned . . .” (10, p. 185) and that "the strength 
of the connections at other points of the zone 
can be determined only from a knowledge of 
the strength of the receptor-effector connection 

. at the point of reinforcement and the 
extent of the difference . . . between the posi- 
tion of the conditioned stimulus . . . and that 
of the evocation stimulus . . . on the stimulus 
continuum connecting them" (10, p. 187) 
Whether or not the principle holds true of 
attitudes has not been tested, so far as this 
writer knows. To test it, it would be necessary 
to know "the stimulus originally conditioned," 
the gradient along which the various "evoca- 
tion stimuli" are located, and whether or not 
in fact these latter stimuli have never been 
previously reinforced or extinguished. This 
could be done only by means of a careful life 
history of various individuals and by some 
Objective measurement of the strength of the 
attitude, 

At any rate, this generalization of Hull has 
an important bearing on the strength of atti- 
tude.(The "strength" of an attitude is almost 
as ambiguous as the concept of attitude itself. 
One type of strength. that seems important for 
predicting future behavior of the individual 
is the afferent-habit strength of the attitude, 
i.e., the strength of the bond between the stim- 
ulus pattern and the response which is here 
defined as attitude) In these terms the afferent- 
habit strength of an attitude is a function of 


THE BEHAVIOR OF ATTITUDES 47 


the number of previous reinforcements as well 
as the position along the gradient occupied 
by a particular stimulus pattern. 

4. “An attitude is an implicit response . . . 
which is itself cue- and drive-producing. . . .” 
Like all implicit responses, attitudes can be 
said to have stimulus-value, i.e, they arouse 
other responses. The responses they evoke may 
indeed be various. They may produce a per- 
ceiving response, as has been suggested, and 
in this sense determine to what other stimuli 
the individual subsequently responds. They 
may produce linguistic responses, thoughts, 
images, or stereotypes. They eventually have 
an effect upon overt behavior. Here again ref- 
erence must be made to the habit strength of 
the attitude, only this time it is a bond between 
attitude as a stimulus and a response pattern 
(implicit or overt), rather than between a stim- 
ulus pattern and attitude as a response. This 
phenomenon will be called the efferent-habit 
strength and, like the afferent-habit strength, 
it too can be strengthened or weakened through 
learning. The stimulus-value of attitude sug- 
gests why investigators have discovered so many 
different phenomena—aside from overt be- 
havior—associated with attitude. , 

The definition of attitude refers to the dis- 
tinction made by Miller and Dollard (14, pp- 
22-23) between the cue- and drive-producing 
characteristics of a stimulus. The numbers used 
in counting, they point out, have cue value: 
the number “six” is a response to the cue of 
"five" and in turn acts as a cue to elicit "seven." 
An attitude has cue-value in the sense that it 
acts as a stimulus to produce another response, 
but it also is a drive in the sense that its tension 
is reduced through subsequent behavior leading 
to a reward. It is proper, consequently, to refer 
to the drive strength of an attitude as well as 
to the strength of the bond between the attitude 
and the responses it evokes. An individual, for 
example, who has not been thinking of food is 
shown a picture of a type of food he likes. The 
picture may be said to evoke his favorable atti- 
tude toward the food which is a drive whose 
goal response is eating. If the secondarily evoked 
response pattern is only salivation and the 
thought or statement of how he likes the food, 
the drive may be considered relatively weak. 
But if he drops what he is doing and goes out 
to eat immediately after seeing the picture, the 
drive is relatively strong. 

The strength of an attitude, therefore, may 
refer to its afferent-habit strength, its efferent- 
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habit strength, or its drive strength. Knowledge 
of an attitude’s direction or content is not 
equivalent to measuring its strength. The atti- 
tude possessed by two people, for example, may 
be more or less identical in formal content as 
measured by an attitude scale, but its role in 
determining overt behavior may be quite di- 
verse. In one individual it may be evoked by a 
restricted stimulus pattern (specific attitude), it 
may not be frequently evoked (a weak afferent- 
habit), or the entire pattern of additional im- 
plicit and overt responses may be restricted (a 
weak drive). Any one or more of these condi- 
tions may be different in the second individual. 
Thus the role of the patriotic attitude in the 
average citizen is different from the way it func- 
tions within a professional patriot. Or within 
the same individual the patriotic attitude may 
be more easily and variously aroused and lead 
more certainly to a greater number of responses 
during a war than when there is peace. 

The fate of an attitude over time—whether 
it persists or changes—is obviously a function 
^ of complicated processes within the individual. 
At least three factors are involved. The first is 
the reward or punishment associated with the 
goal response. An attitude will persist when it 
is constantly reinforced or it will change when 
it is partially or wholly extinguished. Its affer- 
ent- and efferent-habit strength is then increased 
or decreased. The efferent-habit strength is a 
direct function of the frequency with which 
the attitude has mediated rewarding or non- 
rewarding behavior as well as of the immediacy 
and amount of the reward or non-reward. Af- 
ferenthabit strength depends ultimately upon 
efferent-habit strength since reinforcement or 
non-reinforcement results not from the arousal 
of the attitude but from the behavior mediated 
by the attitude; it also depends on the frequency 
with which particular stimulus patterns have 
aroused the mediating attitude and the distinc- 
tiveness of those patterns. In Hull's words (9), 
attitudes have the property of "pure stimulus 
acts" within a single individual: they are re- 
sponses which are reinforced "because they, as 
stimuli, evoke other acts . . . which will bring 
about a reinforcing state of affairs. . . ." 

Secondly, there is the factor of conflict with 
competing drives which may determine the fate 
of an attitude. The afferent- and efferent-habit 
strength of an attitude may be great, but its 
drive strength may be weak in comparison with 
other attitudes or drives aroused by the same 
or different stimulus patterns. The individual, 


for example, may not express his attitude in 
overt behavior because its expression would be 
contrary to his general philosophy; but his atti- 
tude persists. Finally, there is forgetting which 
may involve other psychological processes be- 
sides extinction through non-reinforcement. 

It is possible, consequently, to have an atti- 
tude play a less significant role in a personality 
by diminishing the number of stimuli which 
evoke it, by having the attitude aroused less 
frequently, by weakening its drive strength, by 
setting up stronger competing drives, or by 
punishing the overt behavior which it mediates. 
Since the afferent-habit strength is affected by 
the final goal response, moreover, it follows that 
the attitude's strength in this respect can remain 
more or less constant even though its efferent- 
habit strength vis-à-vis a particular response 
pattern is weakened: it is only necessary that 
some reinforcement be obtained by having the 
attitude mediate other goal responses. The prej- 
udiced individual, for example, can retain his 
prejudice not by always behaving identically in 
response to a given stimulus pattern in his so- 
ciety but by behaving differently and with more 
or less equal satisfaction. 

There is a voluminous literature on the sub- 
ject of attitude change, a good part of which is 
called experimental (cf. 15, pp. 946-980). In 
general, the approach has been to subject a 
group of individuals to a collection of stimuli 
(like a course in race relations, a motion pic- 
ture, propaganda analysis, or a visit to an ethnic 
group in a large American city). Their changes 
in attitude (if any) are ascertained by compar- 
ing their scores on an attitude scale before and 
after being subjected to the stimuli. Sometimes 
the changes are compared with those of a con- 
trol group which has not been so stimulated. It 
must be said that this type of approach is so 
cially useful but psychologically sterile: it is im- 
portant to know that lectures or visits do or do 
not affect students but, except for sporadic at 
tempts to correlate attitude change with obvi- 
ous factors like intelligence and “intensity” of 
the attitude at the outset, no effort has been 
made to suggest precisely why the attitudes do 
or do not change. They change, it is said, “be- 
cause" the student heard the lecture or met à 
Negro or saw a motion picture. Obviously the 
word "because" is misleading. All that has been 
demonstrated is a correlation, not a causal con- 
nection. Two people with approximately the 
same attitude and subject to approximately the 
same stimulus pattern, for example, will be af 


fected differently because different responses, 
including or excluding the attitude, are evoked 
within them, Or the overt effect may be the 
same for different reasons: in one, the afferent 
strength of the attitude may be altered only in 
respect to particular stimulus patterns and not 
to others; whereas in another, the drive strength 
of the attitude may be reduced. To maintain, 
therefore, that a course or a contact produces 
attitude change in a group of individuals (with 
a large or small sigma) is to state an historically 
limited conclusion which certainly may be so- 
cially useful at a given moment. It is like indi- 
cating that the abolition of a slum area dimin- 
ishes the delinquency rate; it does, but thereby 
little insight is gained into the nature of delin- 
quency (e.g, why there are delinquents from 
non-slum areas). Research on attitude change, 
consequently, is likely to produce superficial 
generalizations which, being derived so uniquely 
from particular phenomena, can be applied to 
future situations only with extreme difficulty. 
The psychological aspects of this definition of 
attitude, finally, can shed some light on another 
traditional problem: the relation between atti- 
tude and stereotype. Numerous studies, espe- 
cially those of Katz and Braly (11) and of Child 
and Doob (6), have demonstrated the close con- 
nection between the two. The term “stereo- 
type” is loosely employed. Like attitude, it 
seems to be an implicit response—this is what 
Lippmann (12) many years ago must have 
meant by defining it as “the picture in our 
heads." It becomes an attitude, however, only 
when it is also drive-producing. An individual, 
for example, has a favorable attitude toward 
Great Britain, it is said in oversimplified fash- 
ion, because he considers the British “patriotic.” 
If the oversimplified explanation be accepted 
as valid, some learning process in the past must 
be presumed. A word like “good” must first 
have produced a “favorable” response since it 
was conditioned to a stimulus pattern like par- 
ent or food which for other reasons evoked that 
response. Then "patriotic" became associated 
with “good” for reasons to be found in the in- 
dividual's milieu and also evoked that response. 
"Great Britain" as a stimulus came to produce 
the response of "patriotic" (the stereotype), a 
bit of learning doubtlessly also socially condi- 
tioned. “Patriotic” thus is a ng 
response which, finally, evokes the “favorable 
response to which it has been previously linked. 
In this instance, the stereotype evoked by 
"Great Britain” is a mediating response in a 
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long response sequence and does evoke a drive. 
In like manner, most people in our society 
reply to the question, "Is the earth round or 
flat?" by means of some stereotype which pro- 
duces the word "round." The stereotype ap- 
pears to have cue- but essentially no drive-value 
and hence is not an attitude, although the 
question itself may arouse another implicit 
response with drive-value (attitude): to reply 
incorrectly to a simple question is usually non- 
rewarding. 

5. “An attitude is an implicit response . . . 
which is considered socially significant in the 
individual's society.” If attitudes are defined as 
implicit, drive-producing responses, it is obvious 
that they thereby are placed within the psycho- 
logical sub-class of such responses but that si- 
multaneously they are not distinguished from 
other responses in this group. The other psy- 
chological qualifications which have been dis- 
cussed—their anticipatory and mediating char- 
acter, the variety of stimuli which evoke them, 
and the variety of responses they evoke—are 
also common to other responses which, accord- 
ing to conventional usage, may not be consid- 
ered attitudes. The rat’s aversion to a grid on 
which it has been shocked is doubtless implicit, 
it is anticipatory and mediating in avoiding the 
shock, it can be evoked by a gradient of stimuli, 
and it has drive value resulting in other re- 
sponses which serve perhaps to allay its anxiety; 
but it is ordinarily not useful to call such a re- 
sponse an attitude. Similarly the human being’s 
favorable disposition toward cream cheese or a 
style of hat may or may not be called an atti- 
tude. Psychologically, therefore, attitude is not 
and—it is deliberately maintained—should not 
be distinguished from the larger sub-class of 
responses to which it appears to belong. 

The distinction between attitude and other 
types of responses can be made not on a psycho- 
logical but on a social basis: its socially evalu- 
ated significance must be considered. The rat's 
aversion to the grid is not an attitude unless it 
plays an important part in its behavior in other 
situations involving rats; from the human stand- 
point, it is not at all socially significant. The 
liking of cream cheese or a hat may or may not 
be considered socially significant: from an over- 
all point of view, it is doubtless trivial and 
hence not an attitude, although it undoubtedly 
is of importance to cheese producers and hat 
manufacturers as well as their clients. The 
words "socially significant" must be left unspe- 
cific, since social values fluctuate from society to 
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society and, within a particular society, over 
time and in various groups. Almost always the 
socially significant in a society is evaluated or, 
more simply, is called "good" or "bad" to indi- 
cate its actual or potential effect on people. A 
useful test to decide whether an implicit re- 
sponse with drive-value is an attitude is to dis- 
cover whether that response is labelled desirable 
or undesirable by the individual's contempo- 
raries. In any segment of society at a given mo- 
ment, people are likely to acquire attitudes 
because they are subject to similar learning 
conditions. 


ANALYSIS OF ATTITUDES 


An ideally thorough analysis of an attitude 
requires knowledge (quantitative if possible) of 
the following factors, if its behavior and role in 
determining overt behavior in various situations 
are to be understood: 


l. Goal response: the response pattern or 
patterns which the attitude anticipates and me- 
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7. Behavior Theory and the Behavior of 
Attitudes: Some Critical Comments 


ISIDOR CHEIN 


I. INTRODUCTION 


In his paper on “The Behavior of Attitudes” 
(2) Professor Doob has contributed a lucid and 
thorough analysis of the concept of attitude, 
one which will no doubt have considerable im- 
pact on social psychologists and on the status 
of social psychology. It is desirable to examine 
his argument carefully and to encourage the 
fullest discussion of it because, as will be ex- 
plained shortly, it constitutes an interesting test 
of behavior theory and because, if his conclu- 
sions are accepted, they may have serious im- 
plications for social psychology. 

In brief, Professor Doob examines attitudes 
in the light of behavior theory! (the theory of 
behavior developed by Hull and his associates) 
and comes to the conclusion that it would be 
tempting to propose (if such a proposal had any 
prospects of success) that the concept of attitude 
be dropped from social science, that the con- 
cept serves a quasi-scientific need, and that 
when the concept will finally be abandoned, it 
“will be a happy day for social science, since 
this event will signify the emergence of a more 
integrated and scientific system of human be- 
havior.2 


* Excerpted from an article in Psychological Re- 
view, 1948, 55, 175-188, with permission of the 
author and the American Psychological Associ- 
ation, 


1 We may perhaps register a mild protest at the 
appropriation of “behavior theory” for a particular 
theory of behavior, but we shall retain this usage 
in the present paper. 

2 It should be noted that Doob's main dissatisfac- 
tion with the concept of attitude is not on the 
ground of its vagueness and lack of precise defini- 
tion; This type of criticism of many psychological 
concepts occurs with sufficient frequency, however, 
to justify some comment on it. The present writer, 
at least, does not consider the demonstration that 


Il. THE NATURE OF ATTITUDE 


Doob defines an attitude as “(1) an implicit 
response (2) which is both (a) anticipatory and 
(b) mediating in reference to patterns of overt 
responses, (3) which is evoked (a) by a variety 
of stimulus patterns (b) as a result of previous 
learning or of gradients of generalization and 
discrimination, (4) which is itself cue- and drive- 
producing, (5) and which is considered socially 
significant in the individual's society." 


a term is ambiguously and inconsistently used as 
sufficient reason for discrediting it in scientific dis- 
course. Words that have real referents are neces- 
sarily as ambiguous as their referents are vaguely 
apprehended. To abandon a word because it is 
not precisely defined is to give up the scientific 
quest before it has begun. One begins by vaguely 
observing something (whether it be matter, or 
intelligence, or attitude) and gradually sharpens 
one’s observation until the word designating this 
observed something is more and more precisely 
defined. Science, after all, begins in ignorance. 

Sometimes the process of sharpened observation 
leads to the discovery that the word designates a 
null class, that it has no real referent, and that the 
original vague observation was illusory. This Doob 
does not attempt to show with regard to attitude. 
Sometimes the process of sharpened observation 
leads to the discovery that the word designates a 
class of referents which are included in the refer- 
ents of another word and which are not distin- 
guishable from the latter. Such a discovery leads 
to greater unification of science, and this Doob 
attempts to do with regard to attitude. Such at- 
tempts should, however, be regarded with great 
caution. To be prematurely carried away by them 
is to give up the investigation of a separate and 
possibly real phenomenon. 

It follows from our conception of the evolution 
of scientific definitions that an argument with re- 
gard to a definition is not merely a quibble over 
words; not if the definition is to be regarded (as 
it should be) as the best possible summary, in the 
light of available knowledge, of the essential nature 
of a real referent. 
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The bulk of Doob's paper is devoted to the 
explanation and amplification of this definition 
and of the related points of behavior theory. 
Some highlights of the definition are, in the 
light of Doob's discussion, as follows: An atti- 
tude is a learned response; it may be evoked by 
any one of a variety of stimulus patterns; the 
latter need not have been directly associated 
with the response, but may through a process of 
secondary conditioning have become associated 
with a stimulus which has been directly associ- 
ated with the response, or they may be suffi- 
ciently similar to the latter stimulus to evoke 
the same response; an attitude is itself not 
merely a response, but also a stimulus which 
may evoke any one of a variety of learned re- 
sponses; it is, moreover, the kind of stimulus 
which is tension producing and, consequently, 
some of the learned responses to it (i.e. goal 
responses) have the effect of reducing the ten- 
sion (reward); it cannot itself be observed by 
an outsider, but the responses to which it gives 
rise are observable; it is no different from any 
other learned, implicit, cue- and drive-produc- 
ing response such as a rat's aversion to a grid 
on which it has been shocked; but it may be 
arbitrarily distinguished from other such re- 
sponses on non-scientific grounds (social signif- 
icance). Other points of Doob's definition and 
elaboration will also be commented on in the 
following discussion. 

By way of contrast and to serve as background 
in our discussion of Doob's concept, we present 
an alternative conception of the nature of atti- 
tude, an alternative which we prefer and which 
we believe will be a recognizable version of the 
common usage of the term. 

An attitude, according to this conception, is 
a disposition to evaluate certain objects, actions, 
and situations in certain ways. For purposes of 
the present definition, evaluation requires 
words; the disposition to evaluate need not, how- 
ever, be verbalized. An attitude may be con- 
scious (recognized and known by its possessor") 
or it may be unconscious. It may be momentary 
or persistent. If it persists, it becomes salient in 
situations to which it is pertinent; otherwise it 
may be relatively unobservable, except through 
its effects. It may pertain to matters socially sig- 
nificant or insignificant. The evaluations in- 
volved may correspond to, or differ from, the 
evaluations of most members of the culture or 
subculture or the “coded” cultural and institu- 
tional values. They may or may not have strong 
personal pertinence, self-reference, or ego- 


involvement. They may or may not be asso- 
ciated with strong impelling motives. The 
disposition to evaluate in certain ways is gen- 
erated by such factors as one’s available knowl- 
edge concerning the object, action, or situation, 
one’s comprehension of the interrelatedness and 
interrelations of phenomena and of the conse- 
quences of actions, one's time perspective, one's 
motives and motivational displacements or sub- 
stitutions, one's conscious and unconscious pat- 
terns of phenomenal equivalences (symbolisms 
in the psychoanalytic sense), one's introjected 
moral values, and the interaction of these fac- 
tors in relation to other attitudes. Various as- 
pects of this conception will also be elaborated 
upon in the course of the following discussion. 

From the viewpoint of this second conception 
of the nature of attitude, when Doob writes that 
“The rat's aversion to a grid on which it has 
been shocked is doubtless implicit, it is antici- 
patory and mediating in avoiding the shock, it 
can be evoked by a gradient of stimuli, ond it 
has drive value resulting in other responses 
which serve perhaps to allay its anxiety; but it 
is ordinarily not useful to call such a response 
an attitude," we may add that not only would 
it be not useful to call such a response (or re- 
sponse pattern) an attitude, but that it would 
be wrong. Rats may have aversions, but they do 
not have dispositions to evaluate which, in this 
case, would require a disposition to verbalize 
the aversion. 

It should be noted that, although the two 
conceptions are assuredly different, there are 
many points of agreement or near agreement. 
Both would, for example, agree that (1) à 
person is not born with his attitudes, (2) the 
learning process plays a major role in the de- 
velopment of attitudes, (3) attitudes involve 
problems of perception and motivation, (4) as 
a result of a particular attitude a person may 
be more likely to perceive certain objects than 
others, (5) some attitudes affect perception after 
their arousal even though they may not have 
oriented the person originally in the direction 
of the perceived objects, (6) specific behavior 
cannot be safely predicted from a knowledge 
of attitude alone, and (7) people may act con- 
trary to their attitudes. 


HL POINTS OF INCONSISTENCY IN DOOB'S 
DEFINITION 


The most important inconsistency has to do 
with the term in Doob's definition which asserts 
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that an attitude is a response. Elsewhere, how- 
ever, he writes that, “The individual . . . may 
not express his attitude in overt behayior be- 
cause its expression would be contrary to his 
general philosophy; but his attitude persists” 
(italics added). Now, by any ordinary usage of 
the word “response,” a response occurs and is 
gone; it does not persist. In other words, if an 
attitude can persist, it cannot bc a response. 
Doob may, of course, have in mind a class of 
persistent responses, but such responses do not 
seem to belong in the stimulus-response formula 
and, hence, in behavior theory. In all S-R psy- 
chologies one looks for a stimulus which imme- 
diately precedes the response, and to admit re- 
sponses which keep going (perhaps for many 
years) is certainly to change the psychological 
significance of the stimulus term in the S-R 
formula. 

Despite his definition of an attitude as a re- 
sponse, Doob often seems on the verge of think- 
ing of an attitude as a habit or an established 
stimulus-response "bond." Thus he defines the 
strength of an attitude in terms of the strengths 
of the stimulusresponse bonds in which it is 
involved. This would give meaning to Doob's 
statement that an attitude may persist, for, 
while the response may be momentary, the 
bond presumably persists. It may be more sen- 
sible, therefore, to think of an attitude as the 
habit rather than as the response. 

But an attitude cannot be both a response 
and a habit; it must be one or the other or 
neither. It is our inclination to say neither. For 
in customary usage it seems that there may be 
both persistent and momentary (possibly but not 
necessarily, recurrent) attitudes. And it seems 
to us that both have more in common with 
each other than either does, respectively, with a 
habit or a response. 

Closely related to the question of whether an 
attitude is a response is the question of the re- 
lation of attitude to set. In our own conception, 
attitudes are a species of set. Doob also concedes 
that "Attitude may be included among the sets 
determining both the orientation of the indi- 
vidual as well as the kind of perceptual re- 
sponse he makes," Sets may also be momentary 
or persistent (and hence our previous question 
as to whether an attitude is best conceived of 
in relation to habits or responses or to neither 
applies to set as well), but, even if we concede 
that all sets are responses, the important thing 
about a set is not that it is a response, but the 
selective function that it plays. In other words, 


in playing up the response aspect of attitude, 
Doob is missing the boat. 

That Doob seems to have some kind of an- 
tipathy to set is suggested by the following 
passage: "Writers on attitude are fond of re- 
calling that the tradition of set is somewhat 
hoary in academic psychology, in order to 
demonstrate their own respectability and their 
acquaintance with long German terms like Be- 
wusstseinslage." Perhaps this antipathy accounts 
for Doob's insistence that an attitude is im- 
plicit, ix, not “overt behavior that is observ- 
able to an outsider." His sole justification for 
the criterion of implicitness is his opinion that 
“The semantic connection of such sets [motor 
‘attitudes’ like the physical set of the runner 
before the starter's gun is fired] with the concept 
of attitude employed by social scientists is . . . 
largely fortuitous from an historical stand- 
point." But if attitudes are a species of set, then 
the connection is by no means an historical ac- 
cident; because, apart from the question of 
whether any set can be said to have no motor 
aspects, sets which are manifestly motor and 
other sets have much in common, at least in a 
functional sense. 

The matter is of some importance because, 
in making implicitness a defining attribute of 
attitude, Doob introduces a quite dispensable 
source of ambiguity. There is, of course, no 
sharp line of demarcation between implicit and 
explicit as Doob defines these terms; observ- 
ability varies in degree. This implies that it 
must in many instances be difficult to decide 
(by Doob's definition) whether or not one is 
dealing with an attitude. How unobservable 
must an attitude be before one ceases to refer 
to it as an attitude? Also, it must often be dif- 
ficult to distinguish between an overt manifes- 
tation of an attitude and an overt consequence 
of it. For instance, is a moue of distaste a mani- 
festation or a consequence? If it should be de- 
cided that it is a manifestation, then by Doob's 
definition, whatever it is a manifestation of is 
not an attitude; but if it is regarded as a con- 
sequence, then what it is a consequence of is 
an attitude. Insofar as we may arbitrarily re- 
gard it as either manifestation or consequence, 
then we may arbitrarily decide that we are or 
are not dealing with an attitude. 

As the final item in this section we shall 
raise the question of whether an attitude is 
properly defined as anticipatory. Doob follows 
Hull in defining an anticipatory response as 
"one which originally preceded another re- 
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warded response and which, as a result of be- 
ing associated with or producing this reward, 
has been reinforced so that it occurs before its 
‘original time in the response series.’ ” Doob's 
illustrations are singularly unilluminating with 
regard to attitudes as anticipatory responses. 
Thus, in the case of an individual disliking 
another person, we are told: "Originally the 
avoidance occurred only after actual contact 
had been established and after the contact had 
proven to be punishing and the withdrawal 
to be rewarding." It is clear that the avoidance 
response may occur more quickly as a result 
of learning than it did originally, but it is not 
clear how it can be regarded as an anticipatory 
response since, being the goal response, it is 
always necessarily the last in the series. Doob, 
as a matter of fact, explicitly states that an 
attitude precedes (leads to) the goal response. 
Perhaps, of course, the attitude is the dislike 
rather than the avoidance (the latter being 
regarded, perhaps, as a response to the former) 
and it is the dislike which moves up in the 
response series. The dislike presumably (at 
least by the definition) does not become an 
attitude until it does move up in the response 
series, but it is not at all clear why a dislike 
which arises more quickly should be called an 
attitude while one which arises less quickly 
(and is in fact the same dislike) should not be 
called an attitude. Moreover Doob goes on to 
say that, "When a thorough investigation re- 
veals no actual prior contact, some process of 
generalization or discrimination must have oc- 
curred since all behavior has antecedents." In 
other words, not only does an attitude occur 
before its original time in the response series, 
but it seems that it may occur in a response 
series in which it has never previously occurred 
at all; if mere temporal placement may define 
anticipation, then this is anticipation with a 
vengeance. 

Again the matter is of some importance 
because making an attitude, by definition, 
anticipatory, leads to various difficulties. It pre- 
cludes, for instance, the possibility of func- 
tional or dynamic contemporaneity between 
the attitude and the pertinent goal response. 
Even if an attitude may be said to persist, 
Doob's position implies that whatever effect 
an attitude may have has already taken place 
by the time the goal response occurs. Yet, to 
take one example, a young man may have an 
attitude about kissing girls even while acting 
in accordance with, or contrary to, his attitude; 


and his enjoyment of the process and later 
reactions to it may continue to be influenced 
by his attitude. Doob's formulation raises the 
problem which confronts all attempts to deal 
with dynamics in historical as distinguished 
from ahistorical (cf. 1) terms, namely, how 
something which no longer exists (and it no 
longer exists because it antedates its effects) 
can influence something which is now going 
on. 


IV. POINTS NOT REQUIRED BY BEHAVIOR 
THEORY 


Doob's definition requires that attitudes play 
a mediating role and he explains that “A medi- 
ating response is made in an attempt to in- 
crease the likelihood of the occurrence of 
reward rather than punishment in connection 
with a goal response." This qualification im- 
poses, without any apparent justification, a re- 
striction upon the concept of attitude which is 
not in accord with current usage. And while 
we cannot discern any reason for the restric- 
tion, there are several reasons for objecting 
to it. 

In the first place, attitudes may decrease the 
likelihood of goal responses being rewarding. 
For instance, a student who cheats on an exam- 
ination may find his "achievement" rather 
empty as a consequence of his attitude regard- 
ing honesty. In the second place, by defining 
attitudes as mediating, they are functionally 
placed between the person and his goal re- 
sponses. Yet attitudes play a role in the very 
selection of goal responses. For example, partly 
as a function of his attitudes concerning pet 
sonal integrity, a student may not have the 
alternative of handing in a term paper which 
he himself has not written. It is misleading to 
say that such attitudes are attempts to in- 
crease the likelihood of reward in connection 
with goal responses when, actually, they play 
a role in defining what is rewarding. 

As a second item we shall deal with Doob's 
analysis of attitude strength. Despite our dif ` 
ferences with this analysis, we regard it as an 
extremely important contribution to the psy- 
chology of attitudes, for it makes some impor- 
tant and hitherto neglected distinctions. It 
should be noted, therefore, that our objections 
are not to the distinctions but to the frame- 
work of behavior theory within which they are 
made. 


Doob distinguishes between “afferent-habit 
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strength of the attitude,” its “efferent-habit 
strength,” and its “drive strength.” He defines 
the first as “the strength of the bond between 
the stimulus pattern and the response which 
is here defined as attitude”; the second as the 
strength of the “bond between attitude as a 
stimulus and a response pattern (implicit or 
overt), rather than between a stimulus pattern 
and attitude as a response”; and the third he 
characterizes as follows: "An attitude -has cue- 
value in the sense that it acts as a stimulus to 
produce another response, but it also is a drive 
in the sense that its tension is reduced through 
subsequent behavior leading to a reward." 
While he never quite defines "drive strength," 
it seems clear that this term signifies the degree 
of impulsion to tension reducing behavior. 

Afferent and efferent habit strengths are both 
described in terms of the strength of stimulus- 
response bonds. What Doob means by the 
strength of such a bond is indicated when he 
writes: “. . . it may not be frequently evoked 
(a weak afferent habit). . . ." The translation 
from strength to frequency of evocation is illu- 
minating. Mere frequency of evocation is ob- 
viously irrelevant except in relation to the 
frequency of occurrence of the stimulus pattern. 
But if it is evoked once by a given stimulus 
pattern, why not all the time? To answer this 
question in terms of habit strength is sheer 
tautology. The obviously correct answer is that 
conditions (inner and outer) surrounding the 
bond are different from time to time. It fol- 
lows that the relative frequency of evocation 
is a function of the surrounding conditions 
rather than of "the strength of the bond" and 
that, in this respect at least, all bonds are 
equally strong.3 


3 It may, of course, be argued that Doob's transla- 
tion of bond strength into frequency of evocation 
was merely a careless slip and that it does not do 
justice to behavior theory. If bond strength can 
be defined independently of frequency of evocation, 
then it may perhaps be said that the latter is a 
function of both the bond strength and conditions 
surrounding the bond. Such a definition may ap- 
parently be found in terms of the frequency (and 
quality?) of the reinforcement involved in estab- 
lishing the bond. We say that it may apparently 
be found because, actually, the effect of such a defi- 
nition is to make “bond strength” a verbal synonym 
of “frequency of reinforcement.” One may, perhaps, 
argue that such a definition is really a hypothesis 
that (a) there really is such a variable as bond 
Strength, (b) it is a function of frequency of rein- 
forcement, and (c) it may be measured in terms of 
the latter. Unless, however, some way is presented 
to verify (a) and (b), the hypothesis is empty and 


If "bonds" may be said to differ in strength, 
the strength would have to be measured in 
terms of what it takes to disrupt them rather 
than in terms of frequency of occurrence. But 
if the amount of effort needed to undo a 
"bond" is a measure of its strength, it follows 
that its strength is a function of the knowledge, 
skill, strategy, and/or appropriateness with 
which these efforts are marshalled against it. 
It becomes apparent that its strength does not 
reside solely in the bond. This does not mean 
that such strength is scientifically insignificant, 
any more than mass is insignificant because it 
does not inhere in physical objects. It does 
mean that in dealing with the strength of atti- 
tudes one's perspective has to take in more 
than the learning process and the "bonds" 
themselves. 

With regard to "drive strength," it is at 
present a moot point whether the "drives" asso- 
ciated with some attitudes come from the atti- 
tudes themselves or whether the motivational 
force comes from factors beyond the attitudes 
themselves. The present writer is inclined to 
favor the second of these possibilities. One 
may find attitude and drive varying indepen- 
dently. Many of one's attitudes toward food 
may, for instance, remain unchanged as one 
goes through the hunger cycle. One may have 
an attitude with regard to, say, the marital 
difficulties of two relative strangers and what 


(c) simply establishes "bond strength" and "fre- 
quency of reinforcement" as verbal synonyms. 

A test of the hypothesis might be provided by 
showing that the relative frequency of evocation 
varies as a function of frequency of reinforcement 
despite the constancy of surrounding conditions. 
Unfortunately, we have no way of holding the in- 
ner conditions, at least, constant. Let us, however, 
grant for purposes of discussion the successful 
performance of such an experiment, with positive 
results. Such results would require a peculiar as- 
sumption to explain them. Since all surrounding 
conditions are being held constant, variation in 
frequency of evocation could only be a function 
of variation in the bond (because for a given status 
of the bond it either will or will not produce the 
response under given stimulation) And this vari- 
ation in the bond cannot be a function of some 
variable factor acting on the bond, since it is 
agreed that all other factors are constant. In other 
words, we would have to assume that the bond 
has acquired an inner life as a result of which it 
waxes and wanes independently of what is going 
on around it. We might then argue whether such 
a waxing and waning is best described in terms 
of an invariant adjective, a-given-degree-of-strength. 
But, for our own part, we would rather not make 
this assumption until we absolutely have to. 
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seems to be the same attitude with regard to 
identical difficulties between one’s two closest 
friends. Does the lack of impulsion to do any- 
thing about it in the first case as compared to 
the strong impulsion to do something in the 
second case signify that these are really two 
different attitudes of different drive strengths? 
Or does the drive in the second case derive 
from one's concern about one's friends, the 
attitude to the marital difficulties being in both 
cases the same? Or, to take another instance, 
the attitude of regarding one's own ethnic 
group as superior to other ethnic groups, does 
this attitude derive its motivational force (e.g., 
to resist any overt contradiction of the attitude) 
from within itself, or does it derive its force 
from such motives as the need to maintain 
one's level of self-esteem? 

We have already stressed the point that our 
objections to Doob's distinctions between the 
three types of "attitude strength" are not to 
the distinctions themselves, but to the concep- 
tual framework within which they are made. 
The distinctions have two definite values. In 
the first place, they throw light on what may 
happen when ratings of the strengths of atti- 
tudes are made. Roughly corresponding to 
Doob's “afferent-habit strength," these ratings 
may reflect the degree of confidence that the 
attitude actually exists (e.g., ratings of low in- 
tensity may correspond to "Sometimes I feel 
this way and sometimes I don't—I'm not sure 
how I feel!" Or, "I am ambivalent about it 
—1 feel both ways—I'm not sure how I feel"); 
or, roughly corresponding to Doob's “efferent 
habit strength," these ratings may reflect one's 
degree of conviction that something ought to 
be done and one's readiness to support such 
action; or, finally, roughly corresponding to 
Doob's "drive strength," they may reflect the 
intensity of one's urge to do something one- 
self. The implication is clear that, in obtain- 
ing ratings of the intensity of attitudes, one 
should if possible specify the dimension of 
intensity involved. 

In the second place, the issue of attitude 
strength aside, Doob's analysis is valuable in 
that it calls attention to the fact that an atti- 
tude is not completely described until we know 
its perceptual-cognitive content (the object, 
action, or situation to which it is pertinent), 
its action implications (one's beliefs about 
what should be done about, or with regard to, 
the object) and its motivational force (the 
strength of the motives with which it is bound 
up) But one cannot separate the cognitive 


from the action aspects as Doob does when he 
treats "afferent-habit strength" as involving a 
different "bond" from "efferent-habit strength." 
If two people have attitudes toward a common 
object, but differ on what they believe should 
be done with regard to that object, it seems 
both simpler and more accurate to say that 
they have different attitudes toward that object 
than to say that they have a common afferent 
habit and different efferent habits. Nor can 
one give unqualified assent to Doob's state- 
ment that "Afferent-habit strength depends 
ultimately on efferent-habit strength since rein- 
forcement or nonreinforcement results not 
from the arousal of the attitude but from the 
behavior mediated by the attitude.” For if one 
could disrupt the “afferent habit,” the “efferent 
habit” would automatically go with it, there 
being nothing to arouse it. In other words, it 
is just as true to say that the efferent-habit 
depends on the afferent-habit. The two are, in 
fact, inseparable. 

In passing, we may note Doob's remark that 
a stereotype "becomes an attitude . . . only 
when it is drive-producing." It seems to us 
that a better formulation would be that a 
stereotype is not an attitude because, taken by 
itself, it has no action implications (ie. it 
involves no opinion or belief concerning what 
should be done about the object of the stereo- 
type). It is part of an attitude and may be 
used to gauge attitudes only insofar as one 
may assume action implications. In Doob's 
language, it is an afferent habit which does 
not necessarily have any efferent habits associ- 
ated with it. Insofar as it creates a likelihood 
of accepting certain action implications, it is 
a condition of an attitude. Concretely one may 
think of certain people as being "dirty," but 
if one is indifferent toward people's dirtiness 
this is not an attitude. Associated drives, on 
the other hand, are not at all necessary to 
have an attitude. One may, for instance, have 
an attitude toward the caste system in India 


without being at all impelled to do something 
about it. 


V. POINTS REQUIRED BY BEHAVIOR 
THEORY 


Doob takes Sherif and Cantril (3) to task for 
saying that it is possible to discuss attitudes with- 
out committing oneself to any particular learn- 
ing theory. In a sense, a proponent of behavior 
theory has to take such a position because be- 
havior theory is built around a particular learn- 
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ing theory. If it is to offer an interpretation 
of any psychological phenomenon, it must do 
so in terms of its particular theory of the learn- 
ing process. 

Concerning the genesis of attitudes, we have 
already indicated in presenting our alternative 
to Doob's conception of the nature of an atti- 
tude our agreement with the proposition that 
the learning process plays a major róle. This 
is, by no means, complete agreement, for Doob 
admits only the learning process (and only a 
particular kind of learning process, at that). 
To hold the latter position, it seem to us, is 
to take insufficient account of creative cogni- 
tive processes, both of thought and perception. 
Such processes, we believe, also play a role in 
the generation of attitudes. It consequently 
seems to us that a more satisfactory formulation 
of the relation of learning to attitude would 
be: The residues of past learning play a róle 
in determining how a given situation or object 
is perceived and what a person wants in the 
situation or with regard to the object. The lat- 
ter, in turn, play a róle in determining the 
attitudes which are generated and the salience 
of already existing attitudes. The important 
dynamic fact is how the person perceives the 
situation and what he wants in it rather than 
the fact that learning had previously taken 
place. 

In this connection, we believe that there is 
an intellectual trap in the commonly accepted 
principle that if some psychological process is 
not learned, then it must be innate and, if it 
is not innate, then it must be learned. We 
believe that new insights do arise, and we see 
no good reason why novel features of current 
situations should not be perceived or why pre- 
viously unnoticed features of repeatedly experi- 
enced situations may not be perceived for the 
first time. We also see no reason why such 
cognitive novelties should not play a signifi- 
cant róle with relation to attitude despite an 
absence of associated learning. The atom bomb 
was something new to most people; they had 
no established responses to it; but it exercised 
immediate and profound effects on the atti- 
tudes of many of them. 
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In line with behavior theory, Doob takes 
the position that “Some stimuli come to arouse 
an attitude after a relatively simple process of 
conditioning has occurred. . . . Other stimuli 
evoke or fail to evoke an attitude not because 
of their presence or absence in previous situ- 
ations but because they fall along a stimulus 
gradient of generalization or discrimination." 
The recognition of such gradients is of course 
a great improvement over the oversimplifica- 
tion of early conditioning theories, but from 
our point of view, this is still insufficient. 

The proponents of the more sophisticated 
conditioning theories still act as if new per- 
ceptions never occur or, if they do occur, as 
if they have no dynamic significance since, 
being new, they are not directly or indirectly 
associated with past learning. Suppose, for 
example, that a new perception occurs which 
just doesn't happen to fall along some gener- 
alization or discrimination gradient. Will a 
person remain completely unaffected by it? We 
think not. Nor can one argue that it is impos- 
sible for a perception not to fall along such a 
gradient without also assuming that these gra- 
dients are completely preestablished. Other- 
wise how can a new gradient ever come into 
being? 

This point is particularly emphasized when 
these gradients are themselves interpreted as 
resulting solely from secondary conditioning. 
The Gestalt psychologists have presented 
enough evidence on structural processes in 
perceptual grouping to raise doubts as to the 
sufficiency of secondary conditioning in this 
context. It is necessary, therefore, to appeal to 
some supplementary principle as Doob does 
when he turns to similarity as a factor involved 
in producing generalization gradients. But if 
similarity is interpreted in terms of a doctrine 
of identical elements or identical neural path- 
ways, the viewpoint is again committed to the 
assumption that there can be no novel per- 
ceptual processes, an assumption which we are 
not prepared to accept. If, on the other hand, 
similarity is interpreted in other terms, then 
something more is involved in the generaliza- 
tion gradients besides the learning process itself. 
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8. The Personal Setting of Public Opinions: 
A Study of Attitudes Toward Russia 


M. BREWSTER SMITH 


Underlying the rapid growth of public opinion 
research in the last decade has been the fruit- 
ful conception of public opinion as a sum or 
resultant of the opinions of individual mem- 
bers of the public. Such a conception leads 
immediately to the question, “What are Smith, 
Jones, Brown, and all the other members of 
the public actually thinking on a given sub- 
ject?"—a question to which the techniques of 
the sampling survey are providing increasingly 
adequate answers. When our interest extends 
to the formative and directing influences on 
public opinion, the same approach leads us to 
ask, "Why do Smith, Jones, etc. think as they 
do?" "Depth interviewing" and its modifica- 
tions have been developed to press this attack. 
If we are to take seriously the productive as- 
sumption that public opinion is most effec- 
tively studied on the individual level, an 
important source of insights into the dynam- 
ics of opinion should be found in the study 
of opinions as they reflect the personalities of 
the individuals who hold them. The relatively 
few studies that have made forays into this area 
have not attempted a systematic formulation 
of the ways in which opinions are grounded 
in the personality. 

To develop and test such a coherent psycho- 
logical approach to public opinion, the inten- 
sive methods of clinical personality study and 
the broader but shallower methods of the poll- 
ing survey were brought to a focus in a coop- 
erative study of attitudes toward Russia.! The 


* Reprinted from Public Opinion Quarterly, 1947, 
11, 507-525, with permission of the author and the 
publisher, 

1 This study was made possible by support from 
the Harvard Laboratory of Social Relations. The 
writer is particularly indebted to Dr. J. S. Bruner 
for his participation and guidance in the planning 
and analysis of the survey phase of the project. 


study as a whole will be reported elsewhere. 
Here some of the main findings of a part of 
the project—a panel survey of attitudes toward 
Russia among a cross-section of adult men in 
a New England community—will be described 
with two ends in view: first, to illustrate the 
main lines of an attempt to formulate system- 
atically the nature of opinion and its relation 
to personality factors, and second, to throw 
some incidental light on the important prob- 
lem of American attitudes toward Russia. 

The initial formulations underlying the 
opinion surveys emerged from the related clini- 
cal study of a small number of individuals, 
whose personalities and opinions were investi- 
gated in the course of over thirty hours of 
individual interviewing and testing. Our ob- 


* jective in the opinion surveys was to refine 


and test the generality of formulations that 
had proved important in understanding the 
opinions of the individual cases studied by 
intensive methods. 

A “panel” representing adult men in the 
community was interviewed personally at the 
men's homes, first in March and then in May, 
1947.2 The first series of interviews, which 
systematically explored the men's attitudes 
toward Russia, comprised 319 men. Of these, 
250 were re-interviewed in order to obtain 
information about their personalities. The 
smaller group taking part in both interviews 
did not differ markedly in demographic char- 
acteristics from the initial group. 

Two general approaches were employed in 
the design of the interviews and in their 


2 We are indebted to Dr. Alvin Zander and Dr. 
Seth Arsenian for their kind cooperation in facili- 
tating the field work. Mrs. Betty Lopez, Mr. David 
Schneider, Mr. Henry Riecken, and the author 
trained the interviewers, and Mrs. Lopez super- 
vized the interviewing. 
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analysis. The first of these sought to provide 
a systematic descriptive analysis of attitudes 
toward Russia. Here the attempt was made to 
delineate an "anatomy" of attitudes that would 
provide an adequate basis for the “physiology” 
of a subsequent functional analysis. The latter 
was concerned with the interplay of opinions 
and other manifestations of the personality. 


PRINCIPAL CONCEPTS OF A DESCRIPTIVE 
ANALYSIS 


Descriptively we may fruitfully distinguish 
how a respondent feels about Russia—the affec- 
tive aspect of his attitudes, and what he thinks 
about it—their cognitive aspect. On the affec- 
tive side, there are the familiar factors of 
direction, defined in terms of his approval or 
disapproval of Russia as a whole or of such 
features as he may differentiate, and intensity, 
his degree of concern. On the cognitive side, 
the informational context of his attitudes may 
be distinguished as the structure of his beliefs 
and knowledge that affect his opinions, and 
their time perspective as his expectations con- 
cerning future developments in regard to 
Russia. 

Both the cognitive and affective elements of 
a person's attitudes patently have much to do 
with what he wants to have done about Russia. 
The latter aspect we singled out as the indi- 
vidual's policy orientation: the measures to- 
ward Russia that he supports and opposes. 
Policy orientation has the most direct political 
relevance of the various descriptive categories, 
but is probably bound more closely than the 
others to the issues of the moment. All of these 
categories seem necessary for an adequate de- 
scription of the attitudes comprising public 
opinion. A brief consideration of some of our 
findings may serve to indicate the utility of 
this scheme, as well as to introduce some neces- 
sary elaborations. 


Direction, Intensity, and Organization 


Probably the first problem to be faced in a 
description of public opinion toward Russia 
is its direction: are people favorable or un- 
favorable? As one might expect, the direction 
of opinion was found to vary according to the 
aspect of the problem in question. Direction 
was therefore studied in terms of the lines along 
which attitudes toward Russia were commonly 
differentiated. In most respects, unfavorable 
opinions prevailed. The extent of disapproval 


showed marked variation, however, and in re- 
gard to Russia's part in World War II, the 
proportion of respondents who indicated un- 
qualified approval considerably exceeded the 
disapproving group. 


Per Gent 
Indicating 
Aspect of Russia Approval 
Russia's part in World War II3 46% 
Russia's international role* 14 
Russian system of government5 6 


One need not rest, however, with the state- 
ment that attitudes are predominantly favor- 
able in some respects and unfavorable in 
others. The different aspects of Russia involved 
in the men's attitudes had different weight in 
determining their over-all evaluation of Russia 
and their policy orientation. Table 1 shows, 
for example, that when the respondents were 
asked to pick out what was most important 
(among 12 things listed) in making them feel 
about Russia the way they did, only 9 per cent 
cited "the part Russia played in World War 
Il,” as compared with 23 per cent who selected 
"Russia's part in spreading world Commu- 
nism" and 21 per cent who mentioned "the 
lack of freedom and democracy inside Russia." 

Attitudes toward Russia can therefore be 
conceived as hierarchically organized. The as- 
pects in regard to which Russia was likely to be 
viewed with relative favor turn out only rarely 
to have had determinative importance for the 
respondents. This finding suggests the danger 
‘of looking to poll results from single questions 
for an assessment of the overall direction of 
opinion. It is vital to know whether direction 
of opinion in regard to a given aspect is conse- 
quential or trivial in the organization of the 
respondent's attitudes. The conception of the 


3 The question was: "Looking back on it now, 
what would you say about Russia's part in the 
war?" Data are for respondents indicating unquali- 
fied approval. 

4 Based on the card question: "Now, about Rus- 
sia's part in the world today—do you think it has 
been mostly good, more good than bad, about half 
good and half bad, more bad than good, or mostly 
bad?" Data are for respondents giving the first two 
answers. 

5 Based on the question: "How do you feel about 
the Russian system of government? What is good and 
what is bad about it?" Data are for respondents 
giving favorable answers, with or without qualifi- 
cation. All three questions were asked in the 
March, 1947, survey. 
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TABLE 1. Relative Importance Attributed by Respondents 


to Various Aspects of Russia* 


Aspects of Russian Foreign Relations 52% 
Russia’s part in spreading world Communism 23% 
The possibility of war with Russia 1 
The part Russia played in World War II 9 
Russia's treatment of small countries 9 
Aspects of Russian Internal Affairs 46 
The lack of freedom and democracy inside Russia 21 
The lack of free enterprise inside Russia 12 
Russia’s treatment of the church and religion 7 
Russia’s concern for the welfare of her people 5 
The equality given all races and minority groups in Russia 1 
Russian planning and efficiency ee 
Russian backwardness and inefficiency du 
Russia's point of view on morality and the family 0 
No opinion 2 
Total 10097, 
Number of respondents 250 


* 'The question asked in May, 1947, was: "As you know, we're interested in 
the way people look at Russia, what they see in her. I wonder if you could 
tell me which of these things you think is most important in making you 
feel about Russia the way you do?" [Show card.] 


** Less than 0.5 per cent. 


hierarchical organization of a person's attitudes 
makes it possible to speak consistently of his 
being favorable to Russia in some respects and 
unfavorable in others, at the same time that 
we ascribe to him a generally favorable—or 
unfavorable—attitude toward Russia. 

It is not enough to know the direction of 
attitudes toward Russia; we must also know 
their intensity. It seems likely, in fact, that the 
just-considered hierarchical organization of a 
person's over-all attitude depends directly on 
the intensity of his component attitudes. Thus, 
those features of Russia about which he feels 
most strongly determine his over-all evaluation 
of Russia and have most to do with his policy 
orientation toward the country. Support for 
this plausible view is found in the survey data. 
When asked which things were most important 
and which next most important in making 
them feel as they did about Russia, some re- 
spondents indicated two aspects both of which 
could be classed under “Russian foreign rela- 
tions” (see Table 1 for the classification), while 
others named aspects both in the area of “Rus- 
sian internal affairs" When these extreme 
groups are compared in respect to the inten- 


sity of their attitudes toward Russia's inter- 
national role, 75 per cent of the former group 
as against 56 per cent of the latter are found 
to have said that they were at least "quite a 
lot" concerned.® In this case, then, intensity 
of opinion was higher with respect to the dom- 
inant area in the organization of the respon- 
dents’ attitudes. If we want to find the keystones 
of opinion on a topic, according to this view, 
we should look for the areas in which the 
intensity of opinion is highest. 


Time Perspective 


A study limited to the affective side of opin- 
ion—its direction and intensity—would give 
at best an incomplete understanding of opinion 
phenomena. The person's beliefs and expecta 
tions form the premises for his evaluation and 
policy orientation. In this regard, the complex 
pattern of expectations that constitutes the 


ê The question, asked in the May survey, was 
"How concerned are you about Russia's part in 
the world today—a great deal, quite a lot, not so 
much, or none at all?" There were 71 and 59 re 
spondents in the two groups, respectively. 
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time perspective of a person’s attitudes is of 
major importance. For some, the meaningful 
future extends little beyond the practical con- 
cerns of tomorrow. The opinions of such per- 
sons with narrow time perspective surely take 
form under different influences and have dif- 
ferent meaning from those of persons whose 
broad time perspective includes a careful bal- 
ancing of future possibilities. Another impor- 
tant distinction is between “short run” and 
“long run" perspectives. In the long run per- 
spective, the constraint of reality may be more 
lax than in the short run, while the implica- 
tions of threat or benefit to the individual are 
less immediate. Discrepancies between the two 
views may therefore be expected. 

Results from our survey seem to require this 
distinction between the short and long run. 
When the respondents were asked, 


“Do you think the United States and Russia 
will get along together better, worse, or about 
the same in the next few years?” 


44 per cent said relations would get "better," 
while only 15 per cent indicated that they 
would get "worse." These optimistic findings 
must be contrasted with the results for the 
following question: 


"Do you expect the United States to fight 
in another war within the next 50 years? [If 
‘yes,’ ‘perhaps,’ or ‘don’t know'] Are there any 
particular countries that you think we might 
fight against?” 


Here 51 per cent of the sample said they defi- 
nitely expected another war, and named Rus- 
sia as the probable enemy, while an additional 
13 per cent thought that if there were to be 
another war it would be with Russia. 

Rather than providing a relatively unbiased 
assessment of expectations in regard to pos- 
sible war with Russia, this question appears to 
have tapped the stereotyped assumption of the 
inevitability of future wars, a belief that en- 
tails little personal threat when held in the rela- 
tively remote time perspective of the long run. 
The high proportion who mentioned Russia as 
the probable enemy did not necessarily foresee 
war with Russia; rather, they expected another 
war eventually and regarded the country with 
which American relations were most problem- 
atic as the most likely candidate. In the short 
run, nearly half of the respondents expected 


improvement in relations between the coun- 
tries at the time of the survey. 


Informational Context 


Time perspective is only a part of the im- 
portant cognitive aspect of attitudes. There 
remains to be considered their informational 
context—the entire complex of beliefs and 
knowledge that bears on a person's opinions. 
At the core of the informational context of our 
respondents' attitudes toward Russia lay the 
picture of Russia that they had formed for 
themselves. The degree of factually correct in- 
formation—or informational level—is only one 
aspect of the informational context, and not 
the most important one. 

Several questions revealed the prevalent 
stereotype of Russia among our respondents as 
one of a dictatorship engaged in spreading 
world Communism for purposes of self-aggran- 
dizement. The economic aspect of Communism 
took a relatively secondary place, while con- 
siderations of social welfare figured scarcely at 
all Beliefs concerning moral and religious 
practices were marginal to the stereotype, but 
those that prevailed were still mostly unfavor- 
able in purport. The picture of Russia formed 
by our respondents was of course intimately 
related to the direction and policy orientation 
of their attitudes. 

In regard to informational level, however, as 
estimated from scores on a series of seven "fact 
questions on Russia, there was practically no 
relation to the direction of their opinions about 
Russia's international role nor to their policy 
orientation. On the other hand, the better in- 
formed men were distinctly more optimistic in 
their time perspective in regard to relations 
with Russia. They were less likely to pick Rus- 
sia as a probable enemy in a future war, and, 
as Table 2 illustrates, they were more likely 
to expect United States-Soviet relations to 
improve. 

Results such as these call into question the 
conclusion of Walsh in 1944 that "the decisive 
factor in American opinion toward Russia ap- 
pears to be neither class, nor religion, nor 
political preference, but information." If we 
re-examine the data as summarized by Walsh, 
we find that it is again time perspective—ex- 
pectations about post-war relations—that is 


7 Cf. W. B. Walsh, “What the American People 
Think of Russia," Public Opinion Quarterly, 8, 
No. 4 (1944). 
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TABLE 2. Expectations about United States-Soviet Relations, 


in Relation to Informational Level 


i 


Question: “Do you think the United 
States and Russia will get 


along together better, worse, 
or about the same in the next 


few years?” 


Relations will get better 
Relations will stay the same 
Relations will get worse 

No opinion 


Total 
Number of respondents 


Well- Poorly 
Informed Informed 
Respondents Respondents 
53% 35% 

28 36 
14 16 
5 13 
100% 100% 
155 164 


a 


related to informational level. It may have been 
that the extent of information about Russia 
was also related to the direction and policy 
orientation of opinion at that time, but the 
poll results that he quotes do not provide the 
necessary data. The present scheme has the 
merit of calling attention to distinguishable 
aspects of attitudes that warrant separate 
consideration. 


Policy Orientation 


The cognitive and affective aspects of atti- 
tudes both underlie their policy orientation, 
which we have defined as the measures, in this 
case toward Russia, that the respondent sup- 
ports and opposes. From the standpoint of the 
individual as well as from that of political 
relevance, policy orientation represents in a 
very real sense the point or focus of his atti- 
tudes. Behind the persistent tendency of most 
people to evaluate the salient features of their 
world is the need to know where one stands, 
to know in at least a rudimentary way the 
course of action one would take if the occasion 
should demand it. In the case of political atti- 
tudes on topics of public concern, policy ori- 
entation consists for the most part in align- 
ment in terms of publically defined issues. 

At the time of our first survey (just before 
the President's “Truman Doctrine" speech), 
the following question provided the principal 


information on our respondents’ policy ori- 
entation: 


“Some people say that the United States should 
try to do everything possible to cooperate with 
Russia and others say that we've got to be 


tough with Russia. Which do you agree with 
most?" 


Fully 53 per cent said that the United States 
should be "tough" with Russia, while only a 
third (33 per cent) favored a policy of "coop- 
eration.” In itself, such a finding is not too 
informative. Further questions revea ed, for 
example, that the majority could not rurther 
specify the policies they would favor, while 
many of those favoring “cooperation” appeared 
to have had relatively stern policies in mind. 
In addition, the supporters of a “tough” policy 
proved to be less easily dislodged from their 
position, and were more likely to indicate that 
policy toward Russia would make a difference 
in their vote for a presidential candidate. Such 
matters of lability and passivity of policy orien- 
tation need investigation to give meaning to 
the bare proportions found in support of dif- 
ferent. policies. 


Illustrative Relationships 


The descriptive features of attitudes that 
have been distinguished cannot of course be 
conceived in isolation. The study of their rela- 
tionships is itself informative, and raises sig- 
nificant problems for functional interpretation. 
Two examples may serve to emphasize the com- 
plexity of the processes involved. 

Intensity of concern about Russia was found 
to be associated with a high level of informa- 
tion about Russia. Among the relatively well- 
informed respondents, 71 per cent said they 
were at least "quite a lot" concerned about 
Russia's part in the world today, in comparison 
to 52 per cent among the poorly-informed 
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group. The respondents who were higher in 
intensity were found to have had more fre- 
quent and adequate contact with the press, 
radio, and magazines. Presumably the relation- 
ship is a reciprocal one: knowledge leads to 
concern, while concern leads to receptivity to 
information. This reciprocal relation between 
intensity and informational level has an im- 
portant consequence: holders of the most nar- 
rowly stereotyped unfavorable views were not 
particularly likely to show the intensity that 
their beliefs might be supposed to warrant. 
The ignorance underlying their unfavorable 
stereotypes was part of a complex interaction 
that also involved their relative lack of concern 
about Russia. 

Another close relationship held between the 
direction of a person’s opinions and the nature 
of the beliefs forming his informational con- 
text. There is good evidence that the interplay 
here was also reciprocal. The influence of be- 
liefs on the direction of opinion needs no spe- 
cial supporting evidence. The reverse influence 
is neatly documented by Table 3. There it may 
be seen that respondents who were relatively 
favorable toward Russia were more likely than 
unfavorable respondents to decide erroneously 
that Russia declared war on Japan shortly be- 
fore the first atomic bomb. In this case it seems 
quite clear that the direction of opinion was 
causally prior to the belief. It is unlikely that 
many of the respondents had heard or read the 
erroneous assertion as a fact, while virtually 
all of the respondents must have been aware 


of the true succession of events when they were 
occurring. In all likelihood, those who an- 
swered the question incorrectly had no belief 
on the matter prior to the asking of the ques- 
tion, and created on the spur of the moment 
a belief consistent with the direction of their 
attitudes. 


FUNCTIONAL ASPECTS OF ATTITUDES 
TOWARD RUSSIA 


One cannot pursue the study of attitudes far, 
however, without inquiring about their func- 
tion in personality. The correspondence, for 
example, between the informational context 
and direction of an attitude can only be under- 
stood in terms of the values activating the indi- 
vidual's over-all view of Russia. That is to say, 
the correspondence between the conception of 
Russia as a dictatorship and unfavorable atti- 
tudes is only "natural" in a public sharing 
democratic values to which dictatorship is 
repugnant. Several ways in which attitudes are 
enmeshed with personality factors emerged 
from the study. They may reflect or express 
the person's central values (their value func- 
tion). They may show consistency with his 
characteristic ways of reacting (their consistency 
function), or perhaps gratify indirectly his basic 
needs (their gratification function). They may 
form part of his attempt to construct for him- 
self a stable and meaningful world within 
which he can order his life (their meaning 
function), Finally, they may serve to express 


TABLE 3. Beliefs Concerning Russian Entry into the War against Japan, 
in Relation to Direction of Opinion toward Russia 
ee 


Belief 


Russia declared war in 1941 
Russia declared war shortly before 


the first atomic bomb was dropped 


Russia declared war shortly after 


the first atomic bomb was dropped 


Russia never declared war 
Don't know 


Total 
Number of respondents 


Favorable Unfavorable 
Respondents* — Respondents* 
4% 4% 
49 35 
38 45 
4 9 
5 7 
100% 100% 
45 109 


* The question, asked in March, 1947, was, "Now about Russia's part in the 
world today—do you think it has been mostly good, more good than bad, 
about half good and half bad, more bad than good, or mostly bad?" Data 
for the first two and the last two categories are compared here. 


64 HISTORICAL FOUNDATIONS 


TABLE 4. Intensity of Attitudes toward Russia, in Relation to 
Breadth of Interests 


——————————————(€á 


Question: "How concerned are you about Respondents Respondents 
Russias part in the world with Broad with Narrow 
today? ..." Interests* Interests* 

. a great deal 40% 28% 

+++ quite a lot 40 28 

. not so much 19 43 
. . none at all 1 4 

No opinion 0 2 

Total 10095 100%, 

Number of respondents 73 110 


ee STRSTR 
* The question, asked in May, 1947, was: “By and large, do you usually 
prefer to be with people who are quite a lot concerned about what is going 
on in the world, or with people who are mostly interested in their homes 


and families?” 


his identification with and promote his accep- 
tance by his favored social groups (their con- 
formity function). 


Personal Values and Attitudes toward Russia 


The nature of a person’s central values was 
found to be important for his attitudes toward 
Russia in several respects. In the first place, the 
scope of his interests is of primary importance 
in determining the intensity of his attitudes 
toward Russia. Secondly, the particular values 
that he holds dear sensitize him to correspond- 
ing aspects of Russia and provide him with 
standards of judgment. Finally—a consequence 
of the second point—the nature of his value 
system has much to do with the hierarchical 
organization of his attitudes toward Russia. 

As might be expected, men with broader 
interests were much more likely than others to 
show a relatively high level of intensity in their 
concern about Russia. We took as one rough 
index of breadth of interests whether a respon- 
dent said that he usually preferred "to be with 
people who are quite a lot concerned about 
what is going on in the world, or with people 
who are mostly interested in their homes and 
familie" Men who gave the former response 
were also more likely than others to place high 
value on participation in community affairs 
and on taking an interest in national and world 
affairs. As Table 4 shows, 80 per cent of the 
men with broader interests said that they were 
at least "quite a lot" concerned about Russia's 
part in the wold. Only 51 per cent of the group 
with narrower interests, on the other hand, 
showed this level of intensity. Intensity of atti- 


tudes toward Russia thus formed part of a more 
general tendency to take an interest in the 
world beyond hearth and home. A respondent 
may have prided himself on a range of inter- 
ests that involved concern about Russia; Or, 
his broader range of interests may have led him 
to enrich his information about Russia to an 
extent that he saw its relevance to values that 
were important to him. 

The part played by the person's central 
values in determining the hierarchical organi- 
zation of his attitudes toward Russia may be 
illustrated by the case of the value of liberty. 
Respondents who gave a response classifiable 
under this value when asked what things in 
life were most important to them? (62 cases) 
were compared with all others (188 cases) in 
regard to the aspect of Russia that they said 
had most to do with their feelings. Of the 
group stressing liberty, 36 per cent selected 
"the lack of freedom and democracy inside 
Russia,” compared with 17 per cent of the 
remaining respondents—and this was the only 
notable difference in the responses of the two 
groups. The degree to which a value is im- 
portant to the individual can thus determine 
which aspect of Russia plays the key part in 
the organization of his attitudes. 


8 The question was: "We've been talking about 
some of your present opinions. We are also inter 
ested in finding out what sorts of things people 
think are important in life, I have a question here 
about what you think is important in life. Its a 
little hard to put in words right off, I know, but 
from your experience, what would you say are the 
most important things to you? What sort of things 
mean the most to you?" 
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The correspondence between a person's cen- 
tral values and the features of Russia around 
which his attitudes are organized is, however, 
by no means direct. Some values of greatest 
importance to him may fail entirely to engage 
with his conceptions of Russia. A good exam- 
ple of this was the value of economic security. 
The most frequent responses elicited by the 
open-ended question on personal values re- 
ferred to economic security—matters like a 
steady job, good pay, etc. A naive application 
of the present approach might therefore lead 
one to expect that individual economic secu- 
rity as espoused by Russia would play an im- 
portant part in the attitudes toward Russia 
formed by the men stressing this value. Actu- 
ally, nothing of the kind was observed. There 
was no association between economic security 
as a value and emphasis on corresponding as- 
pects of Russia. 

Two facts that are probably sufficient to 
account for this finding have major implica- 
tions for a general formulation of the relation 
between personal values and the organization 
of attitudes. In the first place, the information 
in terms of which the respondents might have 
seen the relevance of Russia to economic secu- 
rity was simply not available in the current 
media of communication. While it is probably 
true that the holding of a value sensitizes a 
person to perceive and digest information that 
pertains to it, there is a limit to the extent 
that he will actively seek out information that 
is not readily available. In this case, the re- 
sultant informational context furnished no 
basis for the engagement of the value. Secondly, 
the value of economic security was likely to 
entail a rather narrow scope of interests, in 
the form in which it was important to most 
of our respondents. Men who cited it as a 
central value were likely to show relatively 
little interest in current events or community 
participation, Their interests tended to center 
more exclusively around the daily concerns of 
a minimal existence. There was therefore small 
occasion for them to apply the value of eco- 
nomic security in their judgments of Russia. 

For a value to enter into a person’s attitudes 
on a topic, then, there are at least two neces- 
sary conditions: the scope of the value must be 
broad enough to apply to the topic, and the 
information available to the person must con- 
tain at least some basis for engaging his value. 
Taking these limitations into consideration, 
we may tentatively extend our principle of the 
organization of attitudes. It was previously sug- 


gested that the hierarchical organization of a 
person’s attitudes on a complex topic depends 
directly on their relative intensity. Now we 
may say that intensity is a function of the ex- 
tent to which a personal value is engaged and 
of the importance of this engaged value in the 
hierarchy of the person’s central values. 

Other functional relationships to the person- 
ality doubtless also enter into the determina- 
tion of intensity. Here it may be noted that 
the present statement goes far to clarify the 
reciprocal relationship found to exist between 
informational level and intensity. A more ade- 
quate informational context permits the per- 
son's values to engage more fully with Russia. 


Personality Traits and Attitudes toward Russia 


A. person's attitudes are formed so as to be 
consistent with his characteristic modes of re- 
action, and may be pressed into service for the 
indirect gratification of underlying personality 
strivings. Although there is a serious limit to 
the kinds of personality data obtainable from 
door-to-door interviews, one can, nevertheless, 
get at some important personality factors in 
field interviewing. An illustration of the con- 
sistency of personality traits with attitudes 
toward Russia is a case in point. 

People characteristically respond to a frus- 
trating situation, according to Rosenzweig,? in 
one of three ways: they may turn aggressively 
on others (‘“extrapunitive reaction"), or on 
themselves (‘‘intrapunitive reaction"), or they 
may ignore the frustration (‘‘impunitive reac- 
tion"). It seemed likely that these characteris- 
tic types of reaction might carry over into their 
opinions. A rough indicator of the first two of 
these tendencies was provided by responses to 
the following question: 


"When things go wrong, are you more likely 
to get sore at other people or to feel bad and 
blame yourself for the situation?" 


Table 5 compares the men who said that they 
usually "get sore at" others with those who said 
they blamed themselves. It may be seen that 
those who blame others—the "extrapunitive" 
group—were somewhat more likely than the 
"intrapunitive" respondents to blame Russia 
for United States-Soviet disagreements and to 
support a "tough" United States policy toward 


9 Cf. S. Rosenzweig, “Types of Reaction to Frus- 
tration," Journal of Abnormal and Social Psy- 
chology, 29, No. 3 (1934). 
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TABLE 5. Opinions about Russia, in Relation to Reported 
Reaction to Frustration 


Tends to Tends to 
Opinion Blame Others Blame Self 
ee 
Blames Russia for U.S.-Soviet disagreement* 7495 63% 
Expects U.S.-Soviet relations to stay the same 
or deteriorate 76 49 
Favors “tough” U.S. policy toward Russia 67 57 
Number of respondents 46 127 


* The question, asked in May, 1947, was: “Do you think that the present 
disagreements between the United States and Russia are entirely Russia's 
fault, mostly Russia's fault, the fault of the United States, or do you think 
that both countries are equally to blame?" Data are for respondents giving 


the first two answers. The other questions have already been quoted. 


Russia. This is what one would expect if their 
attitudes were to be consistent with the rest of 
their personality tendencies. They were also 
considerably more likely to expect United 
States-Soviet relations to stay the same or de- 
teriorate. The latter finding appears to have 
been an indirect consequence of the distin- 
guishing personality characteristic. Perhaps a 
greater tendency to blame oneself and the 
groups with which one was closely identified 
required a defensive sanguineness from the 
intrapunitive group. Or, on the other hand, 
those who saw the United States as at least 
partly implicated in the blame may have been 
more likely to think the disagreements between 
the countries remediable. 


Attitudes toward Russia as a Source of 
Meaning and Stability 


On a priori grounds one might suppose that 
à person's attitudes toward any topic serve the 
important function of sorting out his world of 
experience into a predictable order that can 
provide the background for an orderly exis- 
tence. It is necessary to postulate some such 
function in order to interpret the constraint 
that knowledge places on wishful thinking. 
The present data on attitudes toward Russia 
provide several illustrations of the tendency 
of a person's beliefs and expectations to con- 
form to the direction of his attitude. But, not 
surprisingly, the more informed respondents 
were less likely to let their feelings enter into 
their beliefs. For example, of the well-informed 
group (155 cases) 55 per cent said that they 
expected the world to become more commu- 


nistic, compared with 41 per cent of the poorly 
informed group (164 cases. Here was an ex- 
pectation that was surely distasteful to a large 
majority of the respondents but was accepted 
by a majority of the well-informed. 

That knowledge imposes a constraint on 
wishful thinking is an obvious fact, but one 
which should not be neglected. This constraint 
would seem to depend on the likelihood that 
beliefs subject to contradiction in the normal 
course of experience may jeopardize the sta- 
bility of one's world picture. It is possible to 
fend off some of the implications of experience, 
but the need for stability places limits on the 
development of one's private world. 

This formulation leads to an hypothesis 
regarding the balance between differentiation 
and consistency in a person's attitudes. An un- 
differentiated attitude, depicting Russia as all 
black or all white, has the advantage of sim- 
plicity, posing fewer problems of decision than 
one compounded of shades of gray. But in or- 
der that a person's picture of the world may 
seem trustworthy and have the stability desira- 
ble in a map from which he takes his bearings. 
it must also take into account the situation as 
he understands it. The extent that a person’s 
attitudes are differentiated, then, may repre- 
sent the compromise between the need for 
simplicity and the need for adequacy that best 
fulfills his requirement of a stable and mean- 
ingful conception of his world. 


Social Conformity and Attitudes toward Russia 


The functional relationships thus far illus 
trated propose relatively basic relations be 
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TABLE 6* 


Evaluation of Russia’s 
International Role 


Mostly good or more good than bad 
About half good and half bad 

More bad than good or mostly bad 
No opinion 


Total 
Number of respondents 


Estimate 
Own of Friends’ 
Opinions Opinions 
11% 796 
29 21 
59 66 
1 6 
100% 100% 
250 250 


* Data are for the May, 1947, survey. 


tween the structure and content of attitudes 
and personality factors. But the content of a 
person's attitudes may also be taken over more 
or less bodily from his associates or from pres- 
tigeful persons as a way of expressing identifi- 
cation with them and facilitating their accep- 
tance. This pressure toward conformity is 
rooted in the person's basic needs for accep- 
tance and approval. It is not, however, the sole 
or perhaps even the principal source of influ- 
ence promoting relative uniformity of opinion 
within face-to-face groups. In addition there is 
the fact that members of such groups are likely 
to have common informational contexts on a 
topic, both because they share similar sources 
of information and because members of the 
group are themselves major sources of infor- 
mation for one another. Furthermore, they are 
also likely to have acquired from one another 
and from similar life experiences a relatively 
similar value outlook. 

The survey findings were in keeping with 
the supposition that conformity plays an im- 
portant role in the determination of a person's 
attitudes. Our data also suggest the interesting 
possibility that the need for conformity may 
favor shifts in the total distribution of opinion. 
A simple comparison of the men's own opin- 
ions with those that they ascribe to their 
friends reveals a tendency for the friends to 
be considered slightly more anti-Russian 
(Table 6). 

Such a finding may be characteristic of a state 
of affairs in which the "anti" position is more 
strongly and vociferously held than the “pro” 
—as our data indicate was certainly the case 
in regard to Russia at the time of the survey. 
To the extent that the need for approval cre- 
ates real pressure toward conformity, con- 
formity must be toward the opinions of others 


as the person understands them. When one 
direction of opinion has relatively higher “au- 
dibility" than the other, conformity may be 
expected to lead not merely toward uniformity 
but toward uniformity in the direction of the 
more audible opinion. 


CONCLUSION 


While the individual attitudes underlying 
public opinion on any topic are indeed com- 
plex, their complexity need not preclude sys- 
tematic analysis. All too often, investigations 
of public opinion have proceeded to devise 
questions about this or that aspect of a public 
issue on a hitor-miss basis. A framework for 
the description of attitudes is clearly needed. 

The descriptive scheme that has been devel- 
oped here in connection with attitudes toward 
Russia should prove generally useful. Cognitive 
and affective elements as well as policy orien- 
tation can be fruitfully distinguished in most 
political attitudes. A systematic approach of 
this sort throws into perspective important 
relationships among attitudinal features from 
which a more adequate understanding of opin- 
ion processes may be attained. 

Such an adequate understanding, however, 
requires insight into the functions that opin- 
ions serve in the psychological economy of the 
person who holds them—the ways in which 
opinions are embedded in personality. We have 
illustrated in connection with attitudes toward 
Russia a first approximation toward an analy- 
sis of these functions. Further investigations 
embodying a functional approach should add 
to the understanding of the dynamics of opin- 
ion on particular topics and in different pub- 
lics, and be a promising source of advance in 
the theory of public opinion. 
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Attitude Measurement 


A. Standardized Measurement Techniques 


9. Measuring Social Distances 


EMORY S. BOGARDUS 


This study is supplementary to one already 
reported upon under the caption “Social Dis- 
tance and its Origins.”1 Social distance, it may 
be repeated, refers to the degrees and grades 
of understanding and feeling that persons ex- 
perience regarding each other. It explains the 
nature of a great deal of their interaction. It 
charts the character of social relations. 

The measurement of social distances is to 
be viewed simply as a means for securing ade- 
quate interpretations of the varying degrees 
and grades of understanding and feeling that 
exist in social situations. The measurement 
exercise and its results indicate the main points 
for intensive inquiry into human experiences. 

In the experiment described here, a modifi- 
cation of a plan first suggested by Dr. Robert 
E. Park was used. One hundred and ten per- 
sons claiming racial descent as indicated in 
Table 1 took part. These individuals were all 
mature persons of experience, being of two 
groups, either young business men, or public 
school teachers. 

Document 1 is a marked copy of the form 
that was used. The first sentence was read 
slowly to the members of each group and its 
chief points carefully explained. Each person 
was then asked to proceed with the marking, 
putting a cross under each one of the seven 
groupings to which he would admit Armeni- 
ans, for example, and so on down the list of 
races without stopping. 

One of the first questions to be raised is: In 
how many groupings in our country may the 
members of any race, as a class, be admitted, 
(as judged by the ratings of the 110 judges using 


* Reprinted from Journal of Applied Sociology, 
1925, 9, 299-308, with permission of the author and 
the publisher. 


1 Journal of Applied Sociology, January-February, 
1925, pp. 216-226. 


TABLE 1. Racial Descent 


English 60 
Germans $1 
Trish 28 
French 24 
Scotch 22 
Dutch 15 
Scotch-Irish 17 
Welch 6 
Canadian 4 
Chinese 3 
Swedish 3 
Norwegian 3 
Japanese 2 
Jew-German 9 
Spanish 2 
Jew-Russian 1 
Filipino j 
Pole 1 
Indian 1 


the arithmetic mean)? By referring to Docu- 
ment l it will be seen that the Armenians 
would be admitted by the specific person who 
made it out to only one group, namely, the 
visitors group, while the English would be 
admitted in five groups. In the first case the 
index to the social contact range is 1.00; and 
in the second instance, 5.00. The social contacts 
open to the English immigrant are five times 
as various as those open to the Greek. The 
Greeks, it may be noted, would be admitted 
to no groups within the United States, and 
thus the social contact range (S. C. R.) index 
in their case would be .00. 

Reference to the second column of Table 3 
shows that according to the 110 raters the so- 
cial contact range varies from 1.18 for the 
Turks to 4.60 for the English, while the social 
contact range accorded the Italians is 2.26, 
which is intermediate between the extremes. 


7 


DOCUMENT 1 
Social Distance 


According to my first feeling reactions I would willingly admit members of 
each race (as a class, and not the best I have known, nor the worst members) 
to one or more of the classifications under which I have placed a cross (x). 


7 6 5 4 3 2 1 
1 2 3 4 5 6 7 
a - 8 | E 
$ 8B . 3-0 dp is 
dp ad dp Feb a, GP a! 
ge 23 28 $55 B8 $5 oF 
oq Eg HÀ pps PE ig 
Cr GER ka hss hb 22 BE 
Armenians x 
Bulgarians ` x 
Canadians x x x x x 
Chinese x 
Czecho-Slovaks x 
Danes x x 
Dutch T T. 
English x x x x x 
French x x x x x 
French-Canadians x x * x x 
Finns x x 
Germans x x x x ai 
Greeks x 
Hindus x 
Hungarians x 
Indians (Amer.) x x 
Trish x x x x x 
Italians x x 
Japanese x 
Jew-German x 
Jew-Russian x 
Koreans x 
Mexicans x 
Mulattos x 
Negroes x 
Norwegians x x x x x 
Portuguese x 
Filipinos x 
Poles x 
Roumanians x x x x x 
Russians x 
Serbo-Croatians x 
Scotch x x x x x 
Scotch-Irish x x x x x 
Spanish x 
Syrians x 
Swedish x x 
Turks x 
Welch x x x x x 
1. Your father's races ...........- English. e eoo p oes bei Ao ni 


2. Your mother's races ...... «eee Scotch «oo ooo oec e sdas Fa EX 


A significant correlation is at once obvious 
between racial membership of the raters and 
the extent of social contact range is to be noted 
in Table 3.2 Where the racial membership is 
low and the range high, as in the case of the 
Canadians, the relationship of the Canadians 
to the English and other "high" races among 
the raters is the chief explanation. Sometimes, 
as indicated by subsequent interviews with the 
raters, a fellow-feeling was aroused primarily 
by a racial group name, such as French- 
Canadian, 

The S. C. R. index does not indicate merit 
or traits of the respective races, but rather 
something of the extent of the social contacts 
open to each race. The smaller the range of 
contacts accorded a race, the less, presumably, 
the opportunities for accommodation and as- 
similation. The social contact range all indi- 
cates something regarding the racial attitudes 
of the raters. In this way the attitudes of dif- 
ferent groups of raters can be compared. For 
example, the 110 raters participating in this 
experiment may be divided into two groups, 
one of the business men and the other of pub- 
lic school teachers. Table 2 gives a comparison 


TABLE 2. Comparison of Social Contact 
Range Indexes Accorded Different Races by 
Business Men and by Public School Teachers 


(Samples) 
By By Public 
Business School 

Races Men Teachers 
Armenians 1.26 1.97 
Chinese 80 1.61 
English 4.33 4.71 
Germans 3.03 3.67 
Hindus 56 1.58 
Japanese .63 1.71 
Negro .88 1.58 
Turk 60 1.51 


of the S. C. R. indexes accorded certain selected 
races. 

From these figures it will be seen that the 
business men did not accord as wide a range 
of social contacts as the teachers to the respec- 
tive races. The experiences of business men 
may be of a different character; at any rate, the 
need for a study of the experiences of the two 


2 Because of lack of space, the actual correlation 
figures are omitted. 
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groups is the chief observation to be made. 
Each of the social contact range indexes is a 
question mark; each raises important questions 
concerning the experiences of the raters and 
the relationships of the raters to the races that 
are being judged. 

A further examination of the original record 
sheets shows a certain grading of the seven-fold 
classification of groups within which races may 
be permitted social contacts (see Document 1). 
If a race as a class is admitted to the intermar- 
riage group it is usually admitted to the next 
four, involving personal "chum," neighborhood, 
occupational, and citizenship contacts. If not 
admitted to the intermarriage group, a race 
may still be admitted to the personal "chum" 
and other regular contact groupings within the 
country; and so on. Moreover, if debarred from 
citizenship, it is also debarred from all the 
groups on the left-hand; if debarred from the 
“visitors only" classification, then it is forbidden 
entrance to all the left-hand groups; and if ex- 
cluded entirely from the country, then it is ex- 
cluded from all other group contacts. In other 
words, the order of the seven-fold classification 
from left to right seems to constitute (further 
experimentation is needed) a gradation in so- 
cial contact distance. If this be true, then an 
index may be worked out. The "best" index 
would be the lowest or .00, that is, no groups 
removed from the individual rater, and the 
"poorest" would be the highest, 6.00, or six 
groups removed. 

Column 3 in Table 3 gives the arithmetic 
mean of the ratings regarding social contact 
distance of the 110 persons participating. 'The 
Canadians are given the closest "contact" possi- 
bilities with a social contact distance (S. C. D.) 
index of .30, while the Turks are put the far- 
thest away with a S. C. D. index of 4.80. In 
other words, when we compare the S. C. D. 
index with the S. C. R. index a high correlation 
is evident, that is, the best, or highest S. C. R. 
indexes parallel with minor variations the best, 
or lowest, S. C. D. indexes? A long "contact 
range" is paralleled by a short "contact dis- 
tance," and vice versa. That is to say, the Ca- 
nadian immigrant is doubly fortunate, and the 
Turkish immigrant is doubly unfortunate, for 
the Canadian immigrant is not only admitted 
to a large range of group contacts, but he is 
admitted to the most intimate groups; the 


3 Again, for lack of space reasons, correlation fig- 
ures are not given. 


74 ATTITUDE MEASUREMENT 


TABLE 3. Recording Social Distance 
(Arithmetic mean of ratings by 110 judges in each case) 


Racial Social Social Social 
Descent Contact Contact Contact 
of Range Distance Quality 
Raters Index Index Index 
See ee e a ae Tae 
Armenians 0 1.77 3.51 6.16 
Bulgarians 0 1.49 3.97 4.62 
Canadians 4 4.55 .80 22.51 
Chinese 3 1.38 4.28 4.12 
Czecho-Slovak 0 1.84 3.46 6.20 
Danes 0 3.44 1.48 14.82 
Dutch 15 3.65 1.12 16.06 
English 60 4.60 27 22.35 
French 24 4.08 1.04 18.67 
French-Canadian 0 3.60 1.41 16.81 
Finns 0 2.42 2.84 9.48 
Germans 31 3.49 1.89 14.95 
Greeks 0 1.56 3.89 5.23 
Hindus 0 1.30 3.35 3.08 
Hungarians 0 1.80 3.45 5.76 
Indians (Amer.) 1 1.90 3.65 7.30 
Trish 28 4.16 93 19.38 
Italians 0 2.26 2.98 8.87 
Japanese 2 1.41 4.30 4.08 
Jew-German 2 1.78 3.83 5.45 
Jew-Russian 1 1.59 4.13 4.94 
Koreans 0 1.28 4.55 3.54 
Mexicans 0 1.58 4.02 4.57 
Mulattos 0 1.26 4.44 3.62 
Negroes 0 1.37 4.10 3.84 
Norwegians 3 3.39 1.67 14.78 
Portuguese 0 1.94 3.95 6.77 
Filipinos 1 1.61 3.90 5.15 
Poles 1 1.90 3.57 6.65 
Roumanians 0 1.71 3.74 5.47 
Russians 0 1.78 3.57 6.40 
Serbo-Croatians 0 1.42 4.43 3.98 
Scotch 22 4.24 69 20.91 
Scotch-Irish 17 4.12 45 23.05 
Spanish 2 2.81 2.28 14.02 
Syrians 0 1.72 4.00 5.83 
Swedish 3 3.71 1.44 16.20 
Turks 0 1.18 4.80 2.91 
Welsh 6 4.15 1.18 18.72 


Turk, on the other hand, is admitted only to 
a small range of contacts and these are of the 
most remote and least intimate types. 

Again, if the 110 raters be divided into two 
groups of business men and of teachers, the 
former are found to put nearly all the races a 
greater distance away than the latter do. 


Reference to Table 3 will show as in connec 
tion with the S. C. R. indexes that the S. C. D. 
indexes parallel the racial descent numbers of 
the 110 raters. The "English descent" raters 
would not only admit English immigrants to 
the largest range of contacts but also to contacts 
of closest intimacy. Serbo-Croatians, for exam- 


TABLE 4. Highest and Lowest S. C. R. 


Indexes (Samples) 
Highest Lowest 
English 4.60 Turk 1.18 
Canadians 4.55 Mulatto 1.26 
Scotch 4.94 Koreans 1.28 
Irish 4.16 Hindus 1.30 
Welsh 4.15 Negro 1.37 


Scotch-Irish 4.12 Serbo-Croatians 1.42 


TABLE 5. Highest and Lowest S. C. D. 
Indexes (Samples) 


Lowest Highest 
Canadians .90 Turk 4.80 
English 40 Mulatto 4.44 
Scotch 69 Serbo-Croatians 4.43 
Scotch-Irish .75 Japanese 4.30 
Trish 93 


ple, are not represented among the raters at all; 
they are given few contact opportunities and 
these of remote types. The chief significance of 


1 2 
To close To club 
kinship as “chums” 
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TABLE 6. Comparison of Social Contact Dis- 
tance Indexes Accorded Different Races by 
Business Men and by Teachers (Samples) 


By 
Business By 
Races Men Teachers 

Armenians 4.16 3.27 
Chinese 4.83 4.01 
English 63 4 
Germans 1.76 1.53 
Hindus 5.40 2.97 
Japanese 5.16 4.02 
Negro 4.63 3.86 
Turk 5.23 4.57 


the S. C. D. indexes is that they raise questions 
regarding the nature of the personal experi- 
ences of each of the 110 raters whereby they 
admit some races to close proximity and put 
others "far away." 

Since the S. C. R. and S. C. D. indexes are 
largely quantitative, the question arises whether 
or not a qualitative index may be derived. By 
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Figure 1. Social contact indexes (samples) for selected races based on reactions of 
110 raters. Figures over light lines indicate the social distances before contacts are 
permitted to the given races. Figures within black bars denote range of the social 


contacts that are permitted each race. 
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assigning arbitrary values to the worth of the 
social contacts in each of the seven classifica- 
tions (Document 1), namely, a value of 7 for 
grouping 1; 6, for grouping 2; and so on, it is 
possible to work out what may be called a social 
contact quality (S. C. Q.) index. By referring to 
Document 1 it will be seen that the S. C. Q. 
index for both the Armenians and Bulgarians 
is 2, but for the English it is 25, the sum of the 
“quality” units in each of the first five groups. 
The S. C. Q. indexes, therefore, might run from 
25 (for any race might be put in each of the 
first five groupings) down to 1 (the contact 
quality value given to grouping 7). 


TABLE 7. Highest and Lowest Social Quality 
Indexes (Samples) 


Highest Lowest 
Canadians 22.51 Turks 2.91 
Scotch-Irish — 23.05 Hindus 3.08 
English 22.35 Mulattoes 3.62 
Scotch 20.91 Koreans 3.54 
Irish 19.38 Negroes 3.84 


The arithmetic mean of the S. C. Q. indexes 
of the 110 raters for each of the races is given 
in the fourth column of Table 3. The five high- 
est and the five lowest S. C. Q. indexes are 
given in Table 7, with the Canadians at the 
top, and the Turks at the bottom, as in the 
case of S. C. R. indexes. The S. C. Q. indexes 
take into consideration more factors than either 


of the other indexes; they may be considered as 


representing a summary of the factors included 
in both the other indexes. As in the case of the 
other indexes, they call for personal experience 
data from each of the raters as a means of in- 
terpretation. They also require extensive ex- 
perimentation, especially in connection with 
the assignment of the values assigned to each 
of the seven groupings. 


Figure 1 is an attempt to visualize the S. C. R. 


indexes, the S. C. D. indexes, and the 3. C. Q. 
indexes, in one picture. The range of con- 
tacts is indicated by the length of the black 
lines; the social contact distance by the dotted 
lines; and the social contact quality by the 
figures to the right. i 


10. Attitudes Can Be Measured 


L. L. THURSTONE 


l. THE POSSIBILITY OF MEASURING 
ATTITUDE 


The purpose of this paper! is to discuss the 
problem of measuring attitudes and opinions 
and to offer a solution for it. The very fact that 
one offers a solution to a problem so complex 
as that of measuring differences of opinion or 
attitude on disputed social issues makes it evi- 
dent from the start that the solution is more or 
less restricted in nature and that it applies only 
under certain assumptions that will, however, 
be described. In devising a method of measur- 
ing attitude I have tried to get along with the 
fewest possible restrictions because sometimes 
one is tempted to disregard so many factors that 
the original problem disappears. I trust that I 
shall not be accused of throwing out the baby 
with its bath. 

In promising to measure attitudes I shall 
make several common-sense assumptions that 
will be stated here at the outset so that subse- 
quent discussion may not be fogged by confu- 
sion regarding them. If the reader is unwilling 
to grant these assumptions, then I shall have 
nothing to offer him. If they are granted, we 
can proceed with some measuring methods that 
ought to yield interesting results. 


* Reprinted from American Journal of Sociology, 
1928, 33, 529-554, with permission of the Univer- 
sity of Chicago Press. This is one of a series of 
papers by the staff of the Behavior Resear Fund, 
Illinois Institute for Juvenile Research, Chicago. 
Series B, No. 110. 


—— 

1 The original manuscript for this paper has en- 
joyed a great deal of friendly criticism, some of 
which turns on matters of terminology and some 
on the assumptions which are here stated. In order 
to keep this paper within reasonable length, the 
description of the detailed psychophysical methods 
used and the construction of several attitude scales 
are reserved for separate publication. This paper 
concerns then only an outline of one solution to 
the problem of measuring attitude. 


It is necessary to state at the very outset just 
what we shall here mean by the terms “attitude” 
and "opinion." This is all the more necessary 
because the natural first impression about these 
two concepts is that they are not amenable to 
measurement in any real sense. It will be con- 
ceded at the outset that an attitude is a com- 
plex affair which cannot be wholly described 
by any single numerical index. For the problem 
of measurement this statement is analogous to 
the observation that an ordinary table is a com- 
plex affair which cannot be wholly described by 
any single numerical index. So is a man such a 
complexity which cannot be wholly represented 
by a single index. Nevertheless we do not hesi- 
tate to say that we measure the table. The con- 
text usually implies what it is about the table 
that we propose to measure. We say without 
hesitation that we measure a man when we take 
some anthropometric measurements of him. 
'The context may well imply without explicit 
declaration what aspect of the man we are mea- 
suring, his cephalic index, his height or weight 
or what not. Just in the same sense we shall say 
here that we are measuring attitudes. We shall 
state or imply by the context the aspect of 
people's attitudes that we are measuring. The 
point is that it is just as legitimate to say that 
we are measuring attitudes as it is to say that 
we are measuring tables or men. 

The concept "attitude" will be used here to 
denote the sum total of a man's inclinations 
and feelings, prejudice or bias, preconceived 
notions, ideas, fears, threats, and convictions 
about any specified topic. Thus a man's attitude 
about pacifism means here all that he feels and 
thinks about peace and war. It is admittedly a 
subjective and personal affair. 

The concept "opinion" will here mean a 
verbal expression of attitude. If a man says that 
we made a mistake in entering the war against 
Germany, that statement will here be spoken of 
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as an opinion. The term "opinion" will be 
restricted to verbal expression. But it is an 
expression of what? It expresses an attitude, 
supposedly. There should be no difficulty in 
understanding this use of the two terms. The 
verbal expression is the opinion. Our interpre- 
tation of the expressed opinion is that the 
man's attitude is pro-German. An opinion sym- 
bolizes an attitude. 

Our next point concerns what it is that we 
want to measure. When a man says that we 
made a mistake in entering the war with Ger- 
many, the thing that interests us is not really 
the string of words as such or even the imme- 
diate meaning of the sentence merely as it 
stands, but rather the attitude of the speaker, 
the thoughts and feelings of the man about the 
United States, and the war, and Germany. It is 
the attitude that really interests us. The opinion 
has interest only in so far as we interpret it as 
a symbol of attitude. It is therefore something 
about attitudes that we want to measure. We 
shall use opinions as the means for measuring 
attitudes.? 

'There comes to mind the uncertainty of 
using an opinion as an index of attitude. The 
man may be a liar. If he is not intentionally 
misrepresenting his real attitude on a disputed 
question, he may nevertheless modify the ex- 
pression of it for reasons of courtesy, especially 
in those situations in which frank expression of 
attitude may not be well received. This has led 
to the suggestion that a man's action is a safer 
index of his attitude than what he says. But 
his actions may also be distortions of his atti- 
tude. A politican extends friendship and hos- 
pitality in overt action while hiding an attitude 
that he expresses more truthfully to an intimate 
friend. Neither his opinions nor his overt acts 
constitute in any sense an infallible guide to the 
subjective inclinations and preferences that con- 
stitute his attitude. Therefore we must remain 
content to use opinions, or other forms of ac- 
tion, merely as indices of attitude. It must be 


? Professor Faris, who has been kind enough to 
give considerable constructive criticism to the 
manuscript for this paper, has suggested that we 
may be measuring opinion but that we are cer- 
tainly not measuring attitude. It is in part a ter- 
minological question which turns on the concept 
of attitude. If the concept of attitude as here de- 
fined is not acceptable, it may be advisable to 
change the terminology provided that a distinction 
is retained between (1) the objective index, which 
is here called the statement or opinion, and (2) the 
inferred subjective inclination of the person, which 
is here called the attitude variable. 


recognized that there is a discrepancy, some 
error of measurement as it were, between the 
opinion or overt action that we use as an index 
and the attitude that we infer from such an 
index. 

But this discrepancy between the index and 
"truth" is universal. When you want to know 
the temperature of your room, you look at the 
thermometer and use its reading as an index of 
temperature just as though there were no error 
in the index and just as though there were a 
single temperature reading which is the “cor- 
rect" one for the room. If it is desired to ascer- 
tain the volume of a glass paper weight, the 
volume is postulated as an attribute of the piece 
of glass, even though volume is an abstraction. 
The volume is measured indirectly by noting 
the dimensions of the glass or by immersing it 
in water to see how much water it displaces. 
These two procedures give two indices which 
might not agree exactly. In almost every situa- 
tion involving measurement there is postulated 
an abstract continuum such as volume or tem- 
perature, and the allocation of the thing mea- 
sured to that continuum is accomplished usually 
by indirect means through one or more indices. 
Truth is inferred only from the relative con- 
sistency of the several indices, since it is never 
directly known. We are dealing with the same 
type of situation in attempting to measure atti- 
tude. We must postulate an attitude variable 
which is like practically all other measurable 
attributes in the nature of an abstract contin- 
uum, and we must find one or more indices 
which will satisfy us to the extent that they are 
internally consistent. 

In the present study we shall measure the 
subject's attitude as expressed by the acceptance 
or rejection of opinions. But we shall not 
thereby imply that he will necessarily act in ac- 
cordance with the opinions that he has indorsed. 
Let this limitation be clear. The measurement 
of attitudes expressed by a man's opinions does 
not necessarily mean the prediction of what he 
will do. If his expressed opinions and his ac- 
tions are inconsistent, that does not concern us 
now, because we are not setting out to predict 
overt conduct. We shall assume that it is of in- 
terest to know what people say that they believe 
even if their conduct turns out to be inconsis- 
tent with their professed opinions. Even if they 
are intentionally distorting their attitudes, we 
are measuring at least the attitude which they 
are trying to make people believe that they 
have. 


We take for granted that people's attitudes 
are subject to change. When we have measured 
a man’s attitude on any issue such as pacifism, 
we shall not declare such a measurement to be 
in any sense an enduring or constitutional con- 
stant. His attitude may change, of course, from 
one day to the next, and it is our task to mea- 
sure such changes, whether they be due to un- 
known causes or to the presence of some known 
persuasive factor such as the reading of a dis- 
course on the issue in question. However, such 
fluctuations may also be attributed in part to 
error in the measurements themselves. In order 
to isolate the errors of the measurement instru- 
ment from the actual fluctuation in attitude, 
we must calculate the standard error of mea- 
surement of the scale itself, and this can be 
accomplished by methods already well known 
in mental measurement. 

We shall assume that an attitude scale is 
used only in those situations in which one may 
reasonably expect people to tell the truth about 
their convictions or opinions. If a denomina- 
tional school were to submit to its students a 
scale of attitudes about the church, one should 
hardly expect intelligent students to tell the 
truth about their convictions if they deviate 
from orthodox beliefs. At least, the findings 
could be challenged if the situation in which 
attitudes are expressed contains pressure or im- 
plied threat bearing directly on the attitude to 
be measured. Similarly, it would be difficult to 
discover attitudes on sex liberty by a written 
questionnaire, because of the well-nigh univer- 
sal pressure to conceal such attitudes where 
they deviate from supposed conventions. It is 
assumed that attitude scales will be used only 
in those situations that offer a minimum of 
pressure on the attitude to be measured. Such 
situations are common enough. 

All that we can do with an attitude scale is 
to measure the attitude actually expressed with 
the full realization that the subject may be 
consciously hiding his true attitude or that the 
social pressure of the situation has made him 
really believe what he expresses. This is a mat- 
ter for interpretation. It is something probably 
worth while to measure an attitude expressed 
by opinions. It is another problem to interpret 
in each case the extent to which the subjects 
have expressed what they really believe. All 
that we can do is to minimize as far as possible 
the conditions that prevent our subjects from 
telling the truth, or else to adjust our interpre- 
tations accordingly. 
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When we discuss opinions, about prohibition 
for example, we quickly find that these opinions 
are multidimensional, that they cannot all be 
represented in a linear continuum. The various 
opinions cannot be completely described merely 
as “more” or “less.” They scatter in many di- 
mensions, but the very idea of measurement 
implies a linear continuum of some sort such 
as length, price, volume, weight, age. When the 
idea of measurement is applied to scholastic 
achievement, for example, it is necessary to 
force the qualitative variations into a scholastic 
linear scale of some kind. We judge in a similar 
way such qualities as mechanical skill, the ex- 
cellence of handwriting, and the amount of a 
man’s education, as though these traits were 
strung out along a single scale, although they 
are of course in reality scattered in many di- 
mensions. As a matter of fact, we get along 
quite well with the concept of a scale in de- 
scribing traits even so qualitative as education, 
social and economic status, or beauty. A scale 
or linear continuum is implied when we say 
that a man has more education than another, 
or that a woman is more beautiful than another, 
even though, if pressed, we admit that perhaps 
the pair involved in each of the comparisons 
have little if anything in common. It is clear 
that the linear continuum which is implied in 
a “more and less” judgment may be conceptual, 
that it does not necessarily have the physical 
existence of a yardstick. 

And so it is also with attitudes. We do not 
hesitate to compare them by the “more and 
less” type of judgment. We say about a man, 
for example, that he is more in favor of prohi- 
bition than some other, and the judgment con- 
veys its meaning very well with the implication 
of a linear scale along which people or opin- 
ions might be allocated. 


2. THE ATTITUDE VARIABLE 


The first restriction on the problem of mea- 
suring attitudes is to specify an attitude vari- 
able and to limit the measurement to that. An 
example will make this clear, Let us consider 
the prohibition question and let us take as the 
attitude variable the degree of restriction that 
should be imposed on individual liberty in the 
consumption of alcohol. This degree of restric- 
tion can be thought of as a continuum ranging 
from complete and absolute freedom or license 
to equally complete and absolute restriction, 
and it would of course include neutral and in- 
different attitudes. 
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In collecting samples from which to construct 
a scale we might ask a hundred individuals to 
write out their opinions about prohibition. 
Among these we might find one which expresses 
the belief that prohibition has increased the 
use of tobacco. Surely this is an opinion con- 
cerning prohibition, but it would not be at all 
serviceable for measuring the attitude variable 
just mentioned. Hence it would be irrelevant. 
Another man might express the opinion that 
prohibition has eliminated an important source 
of government revenue. This is also an opinion 
concerning prohibition, but it would not be- 
long to the particular attitude variable that we 
have set out to measure or scale. It is preferable 
to use an objective and experimental criterion 
for the elimination of opinions that do not be- 
long on the specified continuum to be measured, 
and I believe that such a criterion is available. 

"This restriction on the problem of measuring 
attitudes is necessary in the very nature of mea- 
surement. It is taken for granted in all ordinary 
measurement, and it must be clear that it ap- 
plies also to measurement in a field in which 
the multidimensional characteristics have not 
yet been so clearly isolated. For example, it 
would be almost ridiculous to call attention to 
the fact that a table cannot be measured unless 
one states or implies what it is about the table 
that is to be measured; its height, its cost, or 
beauty or degree of appropriateness or the 
length of time required to make it. The con- 
text usually makes this restriction on measure- 
ment. When the notion of measurement is 
applied to so complex a phenomenon as opinions 
and attitudes, we must here also restrict our- 
selves to some specified or implied continuum 
along which the measurement is to take place. 

In specifying the attitude variable, the first 
requirement is that it should be so stated that 
one can speak of it in terms of "more" and 


” 


“less,” as, for example, when we compare the 
attitudes of people by saying that one of them 
is more pacifistic, more in favor of prohibition, 
more strongly in favor of capital punishment, 
or more religious than some other person. 

Figure 1 represents an attitude variable, mili- 
tarism-pacifism with a neutral zone. A person 
who usually talks in favor of preparedness, for 
example, would be represented somewhere to 
the right of the neutral zone. A person who is 
more interested in disarmament would be rep- 
resented somewhere to the left of the neutral 
zone. It is possible to conceive of a frequency 
distribution to represent the distribution of at- 
titude in a specified group on the subject of 
pacifism-militarism. 

Consider the ordinate of the frequency dis- 
tribution at any point on the base line. The 
point and its immediate vicinity represent for 
our purpose an attitude, and we want to know 
relatively how common that degree of feeling 
for or against pacifism may be in the group 
that is being studied. It is of secondary interest 
to know that a particular statement of opinion 
is indorsed by a certain proportion of that 
group. It is only to the extent that the opinion 
is representative of an attitude that it is useful 
for our purposes. Later we shall consider the 
possibility that a statement of opinion may be 
scaled as rather pacifistic and yet be indorsed 
by a person of very pronounced militaristic 
sympathies. To the extent that the statement 
is indorsed or rejected by factors other than the 
attitude-variable that it represents, to that €x- 
tent the statement is useless for our purposes. 
We shall also consider an objective criterion 
for spotting such statements so that they may 
be eliminated from the scale. In our enure 
study we shall be dealing, then, with opinions, 
not primarily because of their cognitive content 
but rather because they serve as the carriers or 
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symbols of the attitudes of the people who ex- 
press or indorse these opinions. 

There is some ambiguity in using the term 
attitude in the plural An attitude is repre- 
sented as a point on the attitude continuum. 
Consequently there is an infinite number of 
attitudes that might be represented along the 
attitude scale. In practice, however, we do not 
differentiate so finely. In fact, an attitude, prac- 
tically speaking, is a certain narrow range or 
vicinity on the scale. When a frequency distri- 
bution is drawn for any continuous variable, 
such as stature, we classify the variable for de- 
scriptive purposes into steps or class intervals. 
The attitude variable can also be divided into 
class intervals and the frequency counted in 
each class interval. When we speak of "an" at- 
titude, we shall mean a point, or a vicinity, on 
the attitude continuum. Several attitudes will 
be considered not as a set of discrete entities, 
but as a series of class intervals along the atti- 
tude scale. 


3. A FREQUENCY DISTRIBUTION OF 
ATTITUDES 


The main argument so far has been to show 
that since in ordinary conversation we readily 
and understandably describe individuals as more 
and less pacifistic or more and less militaristic in 
attitude, we may frankly represent this linearity 
in the form of a unidimensional scale. This has 
been done in a diagrammatic way in Figure 1. 
We shall first describe our objective and then 
show how a rational unit of measurement may 
be adopted for the whole scale. 

Let the base line of Figure 1 represent a con- 
tinuous range of attitudes from extreme paci- 
fism on the left to extreme militarism on the 
right. 

If the various steps in such a scale were de- 
fined, it is clear that a person’s attitude on 
militarism-pacifism could be represented by a 
point on that scale. The strength and direction 
of a particular individual's sympathies might 
be indicated by the point a, thus showing that 
he is rather militaristic in his opinions. An- 
other individual might be represented at the 
point b to show that although he is slightly 
militaristic in his opinions, he is not so extreme 
about it as the person who is placed at the 
point a. A third person might be placed at the 
point c to show that he is quite militaristic and 
that the difference between a and c is very 
slight. A similar interpretation might be ex- 
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tended to any point on the continuous scale 
from extreme militarism to extreme pacifism, 
with a neutral or indifference zone between 
them. 

A second characteristic might also be indi- 
cated graphically in terms of the scale, namely, 
the range of opinions that any particular indi- 
vidual is willing to indorse. It is of course not 
to be expected that every person will find only 
one single opinion on the whole scale that he 
is willing to indorse and that he will reject all 
the others. As a matter of fact we should prob- 
ably find ourselves willing to indorse a great 
many opinions on the scale that cover a certain 
range of it. It is conceivable, then, that a paci- 
fistically inclined person would be willing to 
indorse all or most of the opinions in the range 
d to e and that he would reject as too extremely 
pacifistic most of the opinions to the left of d, 
and would also reject the whole range of mili- 
taristic opinions. His attitude would then be 
indicated by the average or mean of the range 
that he indorses, unless he cares to select a par- 
ticular opinion which most nearly represents 
his own attitude. The same sort of reasoning 
may of course be extended to the whole range 
of the scale, so that we should have at least two, 
or possibly three, characteristics of each person 
designated in terms of the scale. These charac- 
teristics would be (1) the mean position that he 
occupies on the scale, (2) the range of opinions 
that he is willing to accept, and (3) that one 
opinion which he selects as the one which most 
nearly represents his own attitude on the issue 
at stake. 

It should also be possible to describe a group 
of individuals by means of the scale. This type 
of description has been represented in a dia- 
grammatic way by the frequency outline. 

Any ordinate of the curve would represent 
the number of individuals, or the percentage 
of the whole group, that indorses the corre- 
sponding opinion. For example, the ordinate at 
b would represent the number of persons in 
the group who indorse the degree of militarism 
represented by the point b on the scale. A 
glance at the frequency curve shows that for 
the fictitious group of this diagram militaristic 
opinions are indorsed more frequently than the 
pacifistic ones. It is clear that the area of this 
frequency diagram would represent the total 
number of indorsements given by the group. 
The diagram can be arranged in several differ- 
ent ways that will be separately discussed. It is 
sufficient at this moment to realize that, given a 
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valid scale of opinions, it would be possible to 
compare several different groups in their atti- 
tudes on a disputed question. 

A second type of group comparison might be 
made by the range or spread that the frequency 
surfaces reveal. If one of the groups is repre- 
sented by a frequency diagram of considerable 
range or scatter, then that group would be 
more heterogeneous on the issue at stake than 
some other group whose frequency diagram of 
attitudes shows a smaller range or scatter. It 
goes without saying that the frequent assump- 
tion of a normal distribution in educational 
scale construction has absolutely no application 
here, because there is no reason whatever to 
assume that any group of people will be nor- 
mally distributed in their opinions about any- 
thing. 

It should be possible, then, to make four types 
of description by means of a scale of attitudes. 
These are (1) the average or mean attitude of 
a particular individual on the issue at stake, (2) 
the range of opinion that he is willing to accept 
or tolerate, (3) the relative popularity of each 
attitude of the scale for a designated group as 
shown by the frequency distribution for that 
group, and (4) the degree of homogeneity or 
heterogeneity in the attitudes of a designated 
group on the issue as shown by the spread or 
dispersion of its frequency distribution. 

This constitutes our objective. The heart of 
the problem is in the unit of measurement for 
the base line, and it is to this aspect of the 
problem that we may now turn. 


4. A UNIT OF MEASUREMENT FOR 
ATTITUDES 


The only way in which we can identify dif- 
ferent attitudes (points on the base line) is to 
use a set of opinions as landmarks, as it were, 
for the different parts or steps of the scale. The 
final scale will then consist of a series of state- 
ments of opinion, each of which is allocated to 
à particular point on the base line. If we start 
with enough statements, we may be able to se- 
lect a list of twenty or thirty opinions so chosen 
that they represent an evenly graduated series 
of attitudes. The separation between successive 
Statements of opinion would then be uniform, 
but the scale can be constructed with a series of 
opinions allocated on the base line even though 
their base line separations are not uniform. For 
the purpose of drawing frequency distributions 


it will be convenient, however, to have the 
statements so chosen that the steps between 
them are uniform throughout the whole range 
of the scale. 

Consider the three statements a, c, and d, in 
Figure 1. The statements c and a are placed close 
together to indicate that they are very similar, 
while statements c and d are spaced far apart 
to indicate that they are very different. We 
should expect two individuals scaled at c and 
a respectively to agree very well in discussing 
pacifism and militarism. On the other hand, we 
should expect to be able to tell the difference 
quite readily between the opinions of a person 
at d and another person at c. The scale separa- 
tions of the opinions must agree with our im- 
pressions of them. 

In order to ascertain how far apart the state- 
ments should be on the final scale, we submit 
them to a group of several hundred people 
who are asked to arrange the statements in 
order from the most pacifistic to the most mili- 
taristic. We do not ask them for their own 
opinions. That is another matter entirely. We 
are now concerned with the construction of 
a scale with a valid unit of measurement. ‘There 
may be a hundred statements in the original 
list, and the several hundred persons are asked 
merely to arrange the statements in rank order 
according to the designated attitude variable. 
It is then possible to ascertain the proportion 
of the readers who consider statement a to be 
more militaristic than statement c. If the two 
statements represent very similar attitudes we 
should not expect to find perfect agreement 
in the rank order of statements a and c. If they 
are identical in attitude, there will be about 
50 per cent of the readers who say that state- 
ment a is more militaristic than statement €, 
while the remaining 50 per cent of the readers 
will say that statement c is more militaristic 
than statement a. It is possible to use the pro- 
portion of readers or judges who agree about 
the rank order of any two statements as a basis 
for actual measurement. 

If 90 per cent of the judges or readers say 
that statement a is more militaristic than state 
ment b a" .90) and if only 60 per cent of 
the readers say that statement a is more mili- 
taristic than statement c (p... — .60) then 
clearly the scale separation (a— c) is shorter 
than the scale separation (a — b). The psycho- 
logical scale separation between any two stim- 
uli can be measured in terms of a law of 


comparative judgment which the writer has re- 
cently formulated 

The detailed methods of handling the data 
will be published in connection with the con- 
struction of each particular scale. The practi- 
cal outcome of this procedure is a series of 
statements of opinions allocated along the base 
line of Figure 1. The interpretation of the 
base-line distances is that the apparent differ- 
ence between any two opinions will be equal 
to the apparent difference between any other 
two opinions which are spaced equally far 
apart on the scale. In other words, the shift 
in opinion represented by a unit distance on 
the base line seems to most people the same 
as the shift in opinion represented by a unit 
distance at any other part of the scale. Two 
individuals who are separated by any given 
distance on the scale seem to differ in their 
attitudes as much as any other two individuals 
with the same scale separation. In this sense 
we have a truly rational base line, and the fre- 
quency diagrams erected on such a base line 
are capable of legitimate interpretation as fre- 
quency surfaces. 

In contrast with such a rational base line or 
scale is the simpler procedure of merely listing 
ten to twenty opinions, arranging them in rank 
order by a few readers, and then merely count- 
ing the number of indorsements for each state- 
ment. That can of course be done providing 
that the resulting diagram be not interpreted 
as a frequency distribution of attitude. If so 
interpreted the diagram can be made to take 
any shape we please by merely adding new 
statements or eliminating some of them, arrang- 
ing the resulting list in a rough rank order 
evenly spaced on the base line. Allport’s dia- 
grams of opinionst are not in any sense 
frequency distributions. They should be consid- 
ered as bar-diagrams in which are shown the 
frequency with which each of a number of 
statements is indorsed. Our principal contri- 
bution here is an improvement on Allport's 
procedure, He is virtually dealing with rank 
orders, which we are here trying to change into 


3 For a more detailed discussion of this law sec 
my article “The Law of Comparative Judgment,” 
Psych, Rev. (July, 1927). For the logic of the psy- 
chological S-scale see “Psychophysical Analysis," 
Amer. J. Psych. (July, 1927). 

4 Floyd H. Allport and D. A. Hartman, “Meas- 
urement and Motivation of Atypical Opinion in a 
Certain Group,” Amer. Pol. Sci. Rev., XIX (1925), 
735-760. 
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measurement by a rational unit of measure- 
ment. Allport's pioneering studies in this field 
should be read by every investigator of this 
problem. My own interest in the possibility of 
measuring attitude by means of opinions was 
started by Allport's paper, and the present 
study is primarily a refinement of his statistical 
methods. 

The unit of measurement for the scale of 
attitudes is the standard deviation of the dis- 
persion projected on the psychophysical scale 
of attitudes by a statement of opinion, chosen 
as a standard. It is a matter of indifference 
which statement is chosen as a standard, since 
the scales produced by different standard state- 
ments will have proportional scale values. This 
mental unit of measurement is roughly com- 
parable to, but not identical with, the so-called 
“just noticeable difference” in psychophysical 
measurement. 

A diagram such as Figure 1 can be con- 
structed in either of at least two different ways. 
The area of the frequency surface may be made 
to represent the total number of votes or in- 
dorsements by a group of people, or the area 
may be made to represent the total number 
of individuals in the group studied. Allport's 
diagrams would be made by the latter prin- 
ciple if they were constructed on a rational 
base line so that a legitimate area might be 
measured. Each subject was asked to select that 
one statement in the list most representative 
of his own attitude. Hence at least the sum of 
the ordinates will equal the total number of 
persons in the group. I have chosen as prefer- 
able the procedure of asking each subject to 
indorse all the statements with which he agrees. 
Since we have a rational base line, we may 
make a legitimate interpretation of the area 
of the surface as the total number of indorse- 
ments made by the group. This procedure has 
the advantage that we may ascertain the range 
of opinion which is acceptable to each person, 
a trait which has considerable interest and 
which cannot be ascertained by asking the sub- 
ject to indorse only one of the statements in 
the list. The ordinates of the frequency dia- 
gram can be plotted as proportions of the 
whole group. They will then be interpreted as 
the probability that the given statement. will 
be indorsed by a member of the group. In 
other words, the frequency diagram is descrip- 
tive of the distribution of attitude in the whole 
group, and at each point on the base line we 
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want an ordinate to represent the relative 
popularity of that attitude. 


5. THE CONSTRUCTION OF AN ATTITUDE 
SCALE 


At the present time three scales for the meas- 
urement of opinion are being constructed by 
the principles here described.5 These three 
scales are planned to measure attitudes on 
three different variables, namely, pacifism- 
militarism, prohibition, and attitude toward 
the church. All three of these scales are being 
constructed first by a procedure somewhat less 
laborious than the direct application of the law 
of comparative judgment, and if consistent 
results are obtained the method will be re- 
tained for other scales. 

The method is as follows. Several groups of 
people are asked to write out their opinions 
on the issue in question, and the literature is 
searched for suitable brief statements that may 
serve the purposes of the scale. By editing such 
material a list of from 100 to 150 statements is 
prepared expressive of attitudes covering as 
far as possible all gradations from one end of 
the scale to the other. It is sometimes necessary 
to give special attention to the neutral state- 
ments. If a random collection of statements of 
opinion should fail to produce neutral state- 
ments, there is some danger that the scale will 
break in two parts. The whole range of atti- 
tudes must be fairly well covered, as far as one 
can tell by preliminary inspection, in order to 
insure that there will be overlapping in the 
rank orders of different readers throughout the 
scale. 

In making the intial list of statements sev- 
eral practical criteria are applied in the first 
editing work. Some of the important criteria 
are as follows: 


1. The statements should be as brief as pos- 
sible so as not to fatigue the subjects who are 
asked to read the whole list. 

2. The statements should be such that they 
can be indorsed or rejected in accordance with 
their agreement or disagreement with the atti- 
tude of the reader. Some statements in a ran- 


5 Three attitude scales are now in course of prep- 
aration by Mr. E. J. Chave, of the Divinity School, 
Univemity of Chicago, on attitudes toward the 
church; by Mrs. Hattie Smith on attitudes about 
prohibition; and by Mr. Daniel Droba on attitudes 
about pacifism-militarism. The latter two will be 
published as Doctor's dissertations. 


dom sample will be so phrased that the reader 
can express no definite indorsement or rejec- 
tion of them. 

3. Every statement should be such that ac- 
ceptance or rejection of the statement does 
indicate something regarding the reader's atti- 
tude about the issue in question. If, for exam- 
ple, the statement is made that war is an 
incentive to inventive genius, the acceptance 
or rejection of it really does not say anything 
regarding the reader's pacifistic or militaristic 
tendencies. He may regard the statement as an 
unquestioned fact and simply indorse it as a 
fact, in which case his answer has not revealed 
anything concerning his own attitude on the 
issue in question. However, only the conspicu- 
ous examples of this effect should be eliminated 
by inspection, because an objective criterion 
is available for detecting such statements so 
that their elimination from the scale will be 
automatic. Personal judgment should be mini- 
mized as far as possible in this type. of work. 

4. Double-barreled statements should be 
avoided except possibly as examples of neu- 
trality when better neutral statements do not 
seem to be readily available. Double-barreled 
statements tend to have a high ambiguity. 

5. One must insure that at least a fair ma- 
jority of the statements really belong on the 
attitude variable that is to be measured. If a 
small number of irrelevant statements should 
be either intentionally or unintentionally left 
in the series, they will be automatically elimi- 
nated by an objective criterion, but the cri- 
terion will not be successful unless the majority 
of the statements are clearly a part of the 
stipulated variable. 


When the original list has been edited with 
these factors in mind, there will be perhaps 
80 to 100 statements to be actually scaled. 
These statements are then mimeographed on 
small cards, one statement on each card. Two 
or three hundred subjects are asked to arrange 
the statements in eleven piles ranging from 
opinions most strongly affirmative to those 
most strongly negative. The detailed instruc 
tions will be published with the description 
of the separate scales. The task is essentially 
to sort out the small cards into eleven piles 5o 
that they seem to be fairly evenly spaced or 
graded. Only the two ends and the middle pile 
are labelled. ‘The middle pile is indicated for 
neutral opinions. The reader must decide for 
each statement which of five subjective degrees 
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of affirmation or five subjective degrees of nega- 
tion is implied in the statement or whether it 
is a neutral opinion. 

When such sorting has been completed by 
two or three hundred readers, a diagram like 
Figure 2 is prepared. We shall discuss it with 
the scale for pacifism-militarism as an example. 
On the base line of this diagram are repre- 
sented the eleven apparently equal steps of 
the attitude variable. The neutral interval is 
the interval 5 to 6, the most pacifistic interval 
from 0 to 1, and the most militaristic interval 
from 10 to 11. This diagram is fictitious and 
is drawn to show the principle involved. Curve 
A is drawn to show the manner in which one 
of the statements might be classified by the 
three hundred readers. It is not classified by 
anyone below the value of 3, half of the readers 
classify it below the value 6, and all of them 
classify it below the value 9. The scale value 
of the statement is that scale value below 
which just one half of the readers place it. In 
other words, the scale value assigned to the 
statement is so chosen that one half of the 
readers consider it more militaristic and one 
half of them consider it less militaristic than 
the scale value assigned. The numerical calcu- 
lation of the scale value is similar to the ¢al- 
culation of the limen by the phigamma 
hypothesis in psychophysical measurement. 

It will be found that some of the statements 
toward the ends of the scale do not give com- 
plete ogive curves. Thus statement C is incom- 
plete in the fictitious diagram. It behaves as 
though it needed space beyond the arbitrary 
limits of the scale in order to be completed. 
Its scale value may, however, be determined as 
that scale value at which the phi-gamma curve 
through the experimental proportions crosses 
the 50 per cent level, which is at c. Still other 
Statements may be found, such as D, which 


have scale values beyond the arbitrary range 
of the scale. These may be assigned scale val- 
ues by the same process, though less accurately. 

The situation is different at the other end 
of the scale. The statement E has a scale value 
at e, but owing to the limit of the scale at the 
point 11 the experimental proportion will be 
1.00 at that point. If the scale continued be- 
yond the point 11 the proportions would con- 
tinue to rise gradually as indicated by the 
dotted line. The experimental proportions are 
all necessarily 1.00 for the scale value 11, and 
hence these final proportions must be ignored 
in fitting the phi-gamma curves and in the loca- 
tion of the scale values of the statements. 


6. THE VALIDITY OF THE SCALE 


(a) The scale must transcend the group mea- 
sured. One crucial experimental test must be 
applied to our method of measuring attitudes 
before it can be accepted as valid. A measuring 
instrument must not be seriously affected in 
its measuring function by the object of meas- 
urement. To the extent that its measuring 
function is so affected, the validity of the in- 
strument is impaired or limited. If a yardstick 
measured differently because of the fact that 
it was a rug, a picture, or a piece of paper that 
was being measured, then to that extent the 
trustworthiness of that yardstick as a measur- 
ing device would be impaired. Within the 
range of objects for which the measuring in- 
strument is intended, its function must be 
independent of the object of measurement. 

We must ascertain similarly the range of 
applicability of our method of measuring atti- 
tude. It will be noticed that the construction 
and the application of a scale for measuring 
attitude are two different tasks. If the scale is 
to be regarded as valid, the scale values of the 
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statements should not be affected by the opin- 
ions of the people who help to construct it. 
This may turn out to be a severe test in prac- 
tice, but the scaling method must stand such 
a test before it can be accepted as being more 
than a description of the people who construct 
the scale. At any rate, to the extent that the 
present method of scale construction is af- 
fected by the opinions of the readers who help 
to sort out the original statements into a scale, 
to that extent the validity or universality of 
the scale may be challenged. 

Until experimental evidence may be forth- 
coming on this point, we shall make the as- 
sumption that the scale values of the statements 
are independent of the attitude distribution 
of the readers who sort the statements. The 
assumption is, in other words, that two state- 
ments on a prohibition scale will be as easy 
or as difficult to discriminate for people who 
are "wet" as for those who are "dry." Given 
two adjacent statements from such a scale, we 
assume that the proportion of "wets" who say 
that statement a is wetter than statement b 
will be substantially the same as the corre- 
sponding proportion for the same statements 
obtained from a group of "drys" Restating 
the assumption in still another way, we are 
saying that it is just as difficult for a strong 
militarist as it is for a strong pacifist to tell 
which of two statements is the more militaristic 
in attitude. If, say, 85 per cent of the mili- 
tarists declare statement A to be more mili- 
taristic than statement B, then, according to 
our assumption, substantially the same propor- 
tion of pacifists would make the same judg- 
ment. If this assumption is correct, then the 
scale is an instrument independent of the 
attitude which it is itself intended to measure. 

The experimental test for this assumption 
consists merely in constructing two scales for 
the same issue with the same set of statements. 
One of these scales will be constructed on the 
returns from several hundred readers of mili- 
taristic sympathies and the other scale will be 
constructed with the same statements on the 
returns from several hundred pacifists. If the 
scale values of the statement are practically 
the same in the two scales, then the validity of 
the method will be pretty well established. 


* The neutrality point would not necessarily be 
represented by the same statement for both mili- 
tarists and pacifists, but the scale separations be- 
tween all pairs of statements should be practically 
the same for the two conditions of standardization. 


It will still be necessary to use opinion scales 
with some discretion. Queer results might be 
obtained with the prohibition scale, for exam- 
ple, if it were presented in a country in which 
prohibition is not an issue. 

(b) An objective criterion of ambiguity. 
Inspection of the curves in Figure 2 reveals 
that some of the statements of the fictitious 
diagram are more ambiguous than others. The 
degree of ambiguity in a statement is immedi- 
ately apparent, and in fact it can be definitely 
measured. The ambiguity of a statement is the 
standard deviation of the best fitting phi- 
gamma curve through the observed proportions. 
The steeper the curve, the smaller is the range 
of the scale over which it was classified by the 
readers and the clearer and more precise is the 
statement. The more gentle the slope of the 
curve, the more ambiguous is the statement. 
Thus of the two statements A and B in the 
fictitious diagram the statement A is the more 
ambiguous. 

In case it should be found that the phi- 
gamma function does not well describe the 
curves of proportions in Figure 2, the degree 
of ambiguity may be measured without postu- 
lating that the proportions follow the phi- 
gamma function when plotted on the attitude 
scale. A simple method of measuring ambiguity 
would then be to determine the scale distance 
between the scale value at which the curve of 
proportions has an ordinate of .25 and the 
scale value at which the same curve has an 
ordinate of .75. The scale value of the state- 
ment itself can also be defined, without assum- 
ing the phi-gamma function, as that scale value 
at which the curve of proportions reaches .50. 
If no actual proportion is found at that value, 
the scale value of the statement may be inter 
polated between the experimental proportions 
immediately above and below the .50 level. In 
scaling the statements whose scale values fall 
outside the ten divisions of the scale, it will 
be necessary to make some assumption regard: 
ing the nature of the curve, and it will prob- 
ably be found that for most situations the 
phigamma function will constitute a fairly 
close approximation to truth. 

(c) An objective criterion of irrelevance. 
Before a selection of statements can be made 
for the final scale, still another criterion must 
be applied. It is an objective criterion of irrele- 
vance, Referring again to Figure 1, let us con- 
sider two statements that have identical scale 
values at the point f. Suppose, further, that 


these two statements are submitted to the group 
of readers represented in the fictitious diagram 
of Figure 1. It is quite conceivable, and it ac- 
tually does happen, that one of these state- 
ments will be indorsed quite frequently while 
the other statement is only seldom indorsed in 
spite of the fact that they are properly scaled 
as implying the same degree of pacifism or 
militarism. The conclusion is then inevitable 
that the indorsement that a reader gives to 
these statements is determined only partly by 
the degree of pacifism implied and partly by 
other implied meanings which may or may not 
be related to the attitude variable under con- 
sideration. Now it is of course necessary to 
select for the final attitude scale those state- 
ments which are indorsed or rejected primarily 
on account of the degree of pacifism-militarism 
which is implied in them and to eliminate 
those statements which are frequently accepted 
or rejected on account of other more or less 
subtle and irrelevant meanings. 

An objective criterion for accomplishing this 
elimination automatically and without intro- 
ducing the personal equation of the investi- 
gator is available. It is essentially as follows: 
Assume that the whole list of about one hun- 
dred statements has been submitted to several 
hundred readers for actual voting. These need 
not be the same readers who sorted the state- 
ments for the purpose of scaling. Let these 
readers be asked to mark with a plus sign every 
statement which they indorse and to reject 
with a minus sign every statement not to their 
liking. 

If we want to investigate the degree of ir- 
relevance of any particular statement which, 


Relative frequency of endorsement 
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for example, might have a scale value of 4.0 
in Figure 3, we should first of all determine 
how many readers indorsed it. We find, for 
example, that 260 readers indorsed it. Let this 
total be represented on the diagram as 100 
per cent, and erect such an ordinate at the 
scale value of this statement. We may now 
ascertain the proportion of these 260 readers 
who also indorsed each other statement. If the 
readers indorse and reject the statements 
largely on the basis of the degree of pacifism- 
militarism implied, then those readers who in- 
dorse statements in the vicinity of 4.0 on the 
scale will not often indorse statements that 
are very far away from that point on the scale. 
Very few of them should indorse a statement 
which is scaled at the point 8.0, for example. 
If a large proportion of the 260 readers who 
indorse the basic statement scaled at 4.0 should 
also indorse a statement scaled at the point 8.0, 
then we should infer that their voting on these 
two statements has been influenced by factors 
other than the degree of pacifism that is im- 
plied in the statements. We can represent this 
type of analysis graphically. 

Every one of these other statements will be 
represented by a point on this diagram. Its 
x-value will be the scale value of the statement, 
and its y-value will be the proportion of the 
260 readers who indorsed it. Thus, if out of 
the 260 readers who indorsed the basic state- 
ment there were 130 who also indorsed state- 
ment No. 14, which has a scale value of, say, 
5.0, then statement No. 14 will be represented 
at the point 4 on Figure 3. 

If the basic statement, the degree of irrele- 
vance of which is represented in Figure 3, is 
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an ideal statement, one which people will ac- 
cept or reject primarily because of the attitude 
on pacifism which it portrays, then we should 
expect the one hundred statements to be repre- 
sented by as many points hovering more or 
less about the dotted line of Figure 3. The 
diagram may of course be more contracted or 
spread out, but the general appearance of the 
plot should be that of Figure 3. If, on the 
other hand, the basic statement has implica- 
tions that lead to acceptance or rejection quite 
apart from the degree of pacifism which it con- 
veys, then the proportion of the indorsements 
of the statements should not be a continuous 
function of their scale distance from the basic 
statement. The one hundred points might then 
scatter widely over the diagram. This inspec- 
tional criterion of irrelevance is objective and 
it can probably be translated into a more defi- 
nite algebraic form so as to eliminate entirely 
the personal equation of the investigator. 

Two other objective criteria of irrelevance 
have been devised. They will be described in 
connection with the attitude scales now being 
constructed. 


7. SUMMARY OF THE SCALING METHOD 


The selection of the statements for the final 
scale should now be possible. A shorter list of 
twenty or thirty statements should be selected 
for actual use. We have described three criteria 
by which to select the statements for the final 
scale. These criteria are: 


1. The statements in the final scale should 
be so selected that they constitute as nearly as 
possible an evenly graduated series of scale 
values. 

2. By the objective criterion of ambiguity it 
is possible to eliminate those statements which 
project too great a dispersion on the attitude 
continuum. The objective measure of ambigu- 
ity is the standard deviation of the best fitting 
phi-gamma curve as illustrated in Figure 2. 

3. By the objective criteria of irrelevance it 
is possible to eliminate those statements which 
are accepted or rejected largely by factors other 
than the degree of the attitude-variable which 
they portray. One of these criteria is illustrated 
in Figure 3. 


The steps in the construction of an attitude 
scale may be summarized briefly as follows: 


1, Specification of the attitude variable to be 
measured. 


2. Collection of a wide variety of opinions 
relating to the specified attitude variable. 

3. Editing this material for a list of about 
one hundred brief statements of opinion. 

4. Sorting the statements into an imaginary 
scale representing the attitude variable. This 
should be done by about three hundred readers. 

5. Calculation of the scale value of each 
statement. 

6. Elimination of some statements by the 
criterion of ambiguity. 

7. Elimination of some statements by the 
criteria of irrelevance. 

8. Selection of a shorter list of about twenty 
statements evenly graduated along the scale. 


8. MEASUREMENT WITH AN ATTITUDE 
SCALE 


The practical application of the present 
measurement technique consists in presenting 
the final list of about twenty-five statements of 
opinion to the group to be studied with the 
request that they check with plus signs all the 
statements with which they agree and with mi- 
nus signs all the statements with which they 
disagree. The score for each person is the aver- 
age scale value of all the statements that he 
has indorsed. In order that the scale be effec- 
tive toward the extremes, it is advisable that 
the statements in the scale be extended in both 
directions considerably beyond the attitudes 
which will ever be encountered as mean values 
for individuals. When the score has been deter- 
mined for each person by the simple summa- 
tion just indicated, a frequency distribution 
can be plotted for the attitudes of any specified 
group. 

The reliability of the scale can be ascer- 
tained by preparing two parallel forms from 
the same material and by presenting both 
forms to the same individuals. The correlation 
between the two scores obtained for each per- 
son in a group will then indicate the reliability 
of the scale. Since the heterogeneity of the 
group affects the reliability coefficient, it is 
necessary to specify the standard deviation of 
the scores of the group on which the reliability 
coefficient is determined. The standard error of 
an individual score can also be calculated by 
an analogous procedure. 

The unit of measurement in the scale when 
constructed by the procedure here outlined is 
not the standard discriminal error projected 
by a single statement on the psychological con- 


tinuum. Such a unit of measurement can be 
obtained by the direct application of the law 
of comparative judgment, but it is considerably 
more laborious than the method here described. 
The unit in the present scale is a more arbi- 
trary one, namely, one-tenth of the range on 
the psychological continuum which covers the 
span from what the readers regard as extreme 
affirmation to extreme negation in the particu- 
lar list of statements with which we start. Of 
course the scale values can be determined with 
reliability to fractional parts of this unit. It is 
hoped that this unit may be shown experi- 
mentally to be proportional to a more precise 
and more universal unit of measurement such 
as the standard discriminal error of a single 
Statement of opinion. 

It is legitimate to determine a central ten- 
dency for the frequency distribution of atti- 
tudes in a group. Several groups of individuals 
may then be compared as regards the means of 
their respective frequency distributions of atti- 
tudes. The differences between the means of 
several such distributions may be directly com- 
pared because of the fact that a rational base 
line has been established. Such comparisons are 
not possible when attitudes are ascertained 
merely by counting the number of indorsements 
to separate statements whose scale differences 
have not been measured. 

In addition to specifying the mean attitude 
of each of several groups, it is also possible to 
measure their relative heterogeneity with re- 
gard to the issue in question. Thus it will be 
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possible, by means of our present measurement 
methods, to discover for example that one 
group is 1.6 more heterogeneous in its attitudes 
about prohibition than some other group. The 
heterogeneity of a group is indicated perhaps 
best by the standard deviation of the scale 
values of all the opinions that have been in- 
dorsed by the group as a whole rather than by 
the standard deviation of the distribution of 
individual mean scores. Perhaps different terms 
should be adopted for these two types of 
measurement. 

The tolerance which a person reveals on any 
particular issue is also subject to quantitative 
measurement. It is the standard deviation of 
the scale values of the statements that he 
indorses. The maximum possible tolerance is 
of course complete indifference, in which all of 
the statements are indorsed throughout the 
whole range of the scale. 

If it is desired to know which of two forms 
of appeal is the more effective on any particu- 
lar issue, this can be determined by using the 
scale before and after the appeal. The differ- 
ence between the individual scores, before and 
after, can be tabulated and the average shift 
in attitude following any specified form of 
appeal can be measured. 

The essential characteristic of the present 
measurement method is the scale of evenly 
graduated opinions so arranged that equal 
steps or intervals on the scale seem to most 
people to represent equally noticeable shifts 
in attitude, 


11. The Method of Constructing an Attitude Scale 


RENSIS LIKERT 


I. THE SELECTION OF STATEMENTS 


Each statement should be of such a nature 
that persons with different points of view, so 
far as the particular attitude is concerned, will 
respond to it differentially. Any statement to 
which persons with markedly different attitudes 
can respond in the same way is, of course, 
unsatisfactory. 

The results obtained in constructing the 
present scales demonstrate the value of the 
following criteria. These criteria were kept in 
mind in collecting the statements for the origi- 
nal Survey of Opinions. 


l. It is essential that all statements be ex- 
pressions of desired behavior and not state- 
ments of fact. Two persons with decidedly 
different attitudes may, nevertheless, agree on 
questions of fact. Consequently, their reaction 
to a statement of fact is no indication of their 
attitudes. For example, a person strongly pro- 
Japanese and a person strongly pro-Chinese 
might both agree with the following state- 
ments: 


"The League of Nations has failed in prevent- 
ing Japan's military occupation of Manchuria." 


or 


"Japan has been trying to create in Manchuria 
à monopoly of trade, equivalent to closing the 
'open-door' to the trade of other countries." 


To agree with them or believe them true is in 
no way a measure of attitude. 
Rice (27, p. 184) has clearly stated the im- 


* Excerpted from the Appendix of "A Technique 
for the Measurement of Attitudes," Archives of 
Psychology, 1932, No. 140, 44-58, with the permis- 


sion of the author and the Columbia University 
Press. 


portance of recognizing this criterion when in 
discussing the Thurstone technique he says: 


What is the possibility that the acceptance or 
rejection by a subject of a statement upon the 
completed scale may represent a rational judg- 
ment concerning the truth or falsity of the 
statement made? It would seem to exist. If so, 
the validity of the statement as an index of 
attitude is destroyed or impaired. 

In dealing with expressions of desired be- 
havior rather than expressions of faci the state- 
ment measures the present attitude of the 
subject and not some past attitude. The impor- 
tance of dealing with present rather than past 
attitudes has been emphasized by Thurstone 
(38) and Murphy (22, p. 615). A very conve- 
nient way of stating a proposition so that it does 
involve desired behavior is by using the term 
should. 

2. The second criterion is the necessity of 
stating each proposition in clear, concise, 
straight-forward statements, Each statement 
should be in the simplest possible vocabulary. 
No statement should involve double negatives 
or other wording which will make it involved 
and confusing. Double-barreled statements are 
most confusing and should always be broken 
in two. Often an individual wishes to react 
favorably to one part and unfavorably to the 
other and when the parts are together he is 
at a loss to know how to react. Thus in the 
following illustration a person might well ap- 
prove one part and disapprove another part: 


"In order to preserve peace, the United States 
should abolish tariffs, enter the League of Na- 
tions, and maintain the largest army and navy 
in the world." 


To ask for a single response to this statement 
makes it meaningless to the subject. This state 
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ment should be divided into at least three 
separate statements. 

The simplicity of the vocabulary will, of 
course, vary with the group upon whom the 
scale is intended to be used, but it is a desira- 
ble precaution to state each proposition in 
such a way that persons of less understanding 
than any member of the group for which the 
test is being constructed will understand and 
be able to respond to the statements. Above 
all, regardless of the simplicity or complexity 
of vocabulary or the naiveté or sophistication 
of the group, each statement must avoid every 
kind of ambiguity. 

3. In general it would seem desirable to 
have the questions so worded that the modal 
reaction to some is more toward one end of 
the attitude continuum and to others more in 
the middle or toward the other end. In this 
manner, the modal value of the responses to 
the different questions in the attitude scale 
will range fairly well across the attitude con- 
tinuum. There is no need, however, to have 
questions whose modal reactions are at either 
extreme of the continuum. 

4. To avoid any space error or any tendency 
to a stereotyped response it seems desirable to 
have the different statements so worded that 
about one-half of them have one end of the 
attitude continuum corresponding to the left 
or upper part of the reaction alternatives and 
the other half have the same end of the atti- 
tude continuum corresponding to the right or 
lower part of the reaction alternatives. For 
example, about one-half the statements in the 
Internationalism scale have the international 
extreme corresponding with "Strongly approve" 
while the other half has it corresponding with 
"Strongly disapprove.” These two kinds of 
statements ought to be distributed throughout 
the attitude test in a chance or haphazard 
manner. 

5. If multiple choice statements are used, 
the different alternatives should involve only 
a single attitude variable and not several. 


II. CONSTRUCTING THE SCALE 


It is usually desirable to prepare and select 
More statements than are likely to be finally 
used, because after trying the statements upon 
à group, some may be found to be quite un- 
Satisfactory for the intended purpose. For this 
reason after selecting a good number of state- 
ments they should be given to the group or a 


part of the group whose attitudes we wish to 
measure. The sample used should be sufficiently 
large for statistical purposes. 

For purposes of tabulation and scoring, a 
numerical value must be assigned to each of 
the possible alternatives. If five alternatives 
have been used, it is necessary to assign values 
of from one to five with the three assigned to 
the undecided position on each statement. The 
ONE end is assigned to one extreme of the 
attitude continuum and the FIVE to the other; 
this should be done consistently for each of the 
statements which it is expected will be in- 
cluded in the scale. Thus if we arbitrarily con- 
sider the “favorable to the Negro” extreme 
FIVE and the “unfavorable to the Negro” ex- 
treme ONE, the alternative responses to the 
following statements would be assigned the 
values shown in Table 1. 

Some may object to the designation made, 
saying that the terms “favorable” and “unfavor- 
able” are ambiguous or that the favorable atti- 
tude is just opposite to that here considered 
favorable. 

So far as the measurement of the attitude is 
concerned, it is quite immaterial what the ex- 
tremes of the attitude continuum are called; 
the important fact is that persons do differ 
quantitatively in their attitudes, some being 
more toward one extreme, some more toward 
the other. Thus, as Thurstone has pointed out 
in the use of his scales, it makes no difference 
whether the zero extreme is assigned to “appre- 
ciation of" the church or “depreciation of" the 
church, the attitude can be measured in either 
case and the person’s reaction to the church 
expressed. 

The split-half reliability should be found by 
correlating the sum of the odd statements for 
each individual against the sum of the even 
statements. Since each statement is answered 
by each individual, calculations can be reduced 
by using the sum rather than the average. 

An objective check ought then to be applied 
to see (1) if the numerical values are properly 
assigned and (2) whether the statements are 
“differentiating.” One possible check is item 
analysis which calls for calculating the corre- 
lation coefficient of each statement with the 
battery. If a negative correlation coefficient is 
obtained, it indicates that the numerical values 
are not properly assigned and that the ONE 
and FIVE ends should be reversed. If a zero 
or very low correlation coefficient is obtained, 
it indicates that the statement fails to measure 
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TABLE 1 
Numerical “How far in our educational system (aside from trade edu- 
Value cation) should the most intelligent Negroes be allowed to go? 
1 (a) Grade school. 
2 (b) Junior high school. 
3 (c) High school. 
4 (d) College. 
5 (e) Graduate and professional schools.” 
“In a community where the Negroes outnumber the whites, a 
Negro who is insolent to a white man should be: 
5 (a) Excused or ignored. 
4 (b) Reprimanded. 
3 (c) Fined and jailed. 
2 (d) Not only fined and jailed, but also given corporal 
punishment (whipping, etc.). 
1 (e) Lynched.” 
“All Negroes belong in one class and should be treated in 
about the same way.” 
STRONGLY STRONGLY 
APPROVE APPROVE UNDECIDED DISAPPROVE DISAPPROVE 
Value (1) (2) (8) (4) (5) 
"Where there is segregation, the Negro section should have 
the same equipment in paving, water, and electric light 
facilities as are found in the white districts." 
STRONGLY STRONGLY 
APPROVE APPROVE UNDECIDED DISAPPROVE DISAPPROVE 
Value (5) (4) (3) (2) (1) 


that which the rest of the statements measure. 
Such statements will be called undifferentiat- 
ing. Thurstone (38) refers to them as irrele- 
vant or ambiguous. By "undifferentiating" we 
merely mean that the statement does not mea- 
sure what the battery measures and hence to 
include it contributes nothing to the scale. A 
statement which is undifferentiating for a scale 
measuring one attitude continuum may be 
quite satisfactory for a scale measuring another 
attitude continuum. The following are some 
of the reasons why a statement may prove 
undifferentiating: 


1, The statement may involve a different 
issue from the one involved in the rest of the 
statements, that is, it involves a different atti- 
tude continuum. 

2. The statement may be responded to in 
the same way by practically the entire group. 
For example, the response to the following 
statement was practically the same upon the 
part of all students—some two thousand—to 


whom it was given: "Should the United States 
repeal the Japanese Exclusion Act?" 

3. The statement may be so expressed that 
it is misunderstood by members of the group. 
This may be due to its being poorly stated, 
phrased in unfamiliar words, or worded in the 
form of a double-barreled statement. 

4. It may be a statement concerning fact 
which individuals who fall at different points 
on the attitude continuum will be equally lia- 
ble to accept or reject. 


It is, of course, desirable in constructing an 
attitude scale that the experimenter exercise 
every precaution in the selecting of statements 
so as to avoid those that are undifferentiating. 
However, item analysis can be used as an ob- 
jective check to determine whether the mem- 
bers of a group react differentially to the 
statement in the same manner that they react 
differentially to the battery; that is, item analy- 
sis indicates whether those persons who fall 
toward one end of the attitude continuum on 
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the battery do so on the particular statement 
and vice versa. Thus item analysis reveals the 
satisfactoriness of any statement so far as its 
inclusion in a given attitude scale is concerned. 

No matter for what a priori reasons the 
experimenter may consider a statement to be- 
long in a scale, if the statement, when tried 
on a group, does not measure what the rest 
of the statements measure, there is no justifi- 
cation for keeping that statement in the bat- 
tery. After all, we are interested in measuring 
the attitudes of the members of the group, 
not those of the experimenter. 

There is no reason to expect that the logical 
analysis of the person who selects the state- 
ments will necessarily be supported by the 
group. Quite often, because of a lack of under- 
standing of the cultural background of the 
group, the experimenter may find that the state- 
ments do not form the clusters or hierarchies 
that he expected. It is as important psycho- 


logically to know what these clusters are as it 
is to be able to measure them. 

The degree of inclusion, i.e. the size of the 
correlation coefficient between the item and 
the battery, required for a particular statement 
will no doubt be a function of the purpose for 
which the attitudes are being measured. If a 
general survey type of study is being under- 
taken the degree of inclusion required will be 
less than when a more specialized aspect of 
attitudes is being studied. A similar relation- 
ship is to be noted in the measurement of 
intelligence, 

The only difficulty in using item analysis is 
that the calculation of the necessary coefficients 
of correlation is quite laborious. The criterion 
of internal consistency was tried and the results 
obtained were found to be comparable with 
the results from item analysis. Table 2 shows 
a comparison of the results obtained from item 
analysis and the criterion of internal consis- 


TABLE 2. Comparison of the Results Obtained from the Application of 
the Criterion of Internal Consistency and Item Analysis to the 
Negro Scale for Groups “A” and “B” Combined (N = 62) 


Column 1 Column 2 Column 3 Column 4 Column 5 
1 .69 1.7 2 5 
2 64 1.5 6 6 
3 51 1.7 10 11 
4 .18 0.4 14 14 
5 62 1.3 7 8 
6 40 0.7 1 18 
7 12 0.1 15 15 
8 39 1.1 12 10 
9 26 0.9 13 12 

10 65 2 5 1 
11 .60 1.2 8 9 
12 54 1.4 9 7 
13 67 2.3 4 2 
14 74 2.0 1 3 
15 68 1.6 3 4 


rho (Column 4 vs. Column 5) = +.91) 


Column 1—Statement numbers. 


Column 2—Coefficient of correlation between the score on the individual 
statement and the average score on all fifteen statements. 

Column 8—Difference between the average score of the highest 9 individuals 
and the lowest 9 individuals. 

Column 4—Order of excellence as determined by item analysis based upon 
the coefficients of correlation shown in Column 2. 

Column 5—Order of excellence as determined by the criterion of internal 
consistency based upon the differences shown in Column 3. 


TABLE 3. Criterion of Internal Consistency Applied to the Internationalism Scale for Group “D” (N = 100) 


STATEMENT NUMBERS 


Three-Point Statements 


Five-Point Statements 


HIGH GROUP 
Individual 
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tency. It will be noted that the relation be- 
tween the order of excellence for the different 
statements as determined by item analysis and 
the criterion of internal consistency as ex- 
pressed by rho is + .91. Since the criterion of 
internal consistemcy is much easier to use than 
item analysis and yet yields essentially the same 
results, its use is suggested. 

In using the criterion of internal consistency 
the reactions of the group that constitute one 
extreme in the particular attitude being mea- 
sured are compared with the reactions of the 
group that constitute the other extreme. In 
practice approximately ten per cent from each 
extreme was used. Table 3 shows the criterion 
of internal consistency applied to the Inter- 
nationalism scale for Group D. This criterion 
acts as an objective check upon the correct 
assigning of numerical values in that if the 
numerical values are reversed on a particular 
statement the extreme high group will score 
low on that statement and the extreme low 
group will score high, i.e. we will obtain a nega- 
tive difference between the two extreme groups 
on that question. Furthermore, if a statement 
is undifferentiating it will not differentiate or 
discriminate the two extreme groups, i.e., the 
high group will not score appreciably higher 
than the low group upon that statement. 

Finally, on the basis of the results obtained 
from item analysis or the criterion of internal 
consistency and having due regard for all the 
factors concerned, one should select the most 
differentiating statements for the final form or 
forms of the attitude test. If, through this 
selection of the more differentiating state- 
ments, statements concerning a particular as- 
pect of the attitude being measured are elimi- 
nated, then, obviously, the final scale can only 
be said to measure the attitude continuum rep- 
resented by the remaining statements. For 
example, if it is found by the use of these 
objective checks that statements concerning the 
economic status of the Negro involve an atti- 
tude continuum other than that of statements 
having to do with the social equality of the 
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12. A Basis for Scaling Qualitative Data 


LOUIS GUTTMAN 


1. INTRODUCTION 


In a great deal of research in the social and 
psychological sciences, interest lies in certain 
large classes of qualitative observations. For ex- 
ample, research in marriage is concerned with 
a class of qualitative behavior called marital ad- 
justment, which includes an indefinitely large 
number of interactions between husband and 
wife. Public opinion research is concerned with 
large classes of behavior like expressions of 
opinion by Americans about the fighting ability 
of the British. Educational psychology deals 
with large classes of behavior like achievement 
tests. 

It is often desired in such areas to be able to 
Summarize data by saying, for example, that one 
marital couple is better adjusted than another 
marital couple, or that one person has a better 
opinion of the British than has another person, 
or that one student has a greater knowledge of 
arithmetic than has another student. There has 
been considerable discussion concerning the 
utility of such orderings of persons. It is not our 
intention in this paper to review such discus- 
sions, but instead to present a rather new ap- 
proach to the problem which seems to afford an 
adequate basis for quantifying qualitative data. 

This approach has been used successfully for 
the past year or so in investigating morale and 
other problems in the United States Army by 
the Research Branch of the Morale Services 
Division of the Army Service Forces. While this 
approach to quantification leads to some inter- 
esting mathematics, no knowledge of this math- 
ematics is required in actually analyzing data. 
Simple routines have been established which 
require no knowledge of statistics, which take 
less time than the various manipulations now 


* Reprinted from American Sociological Review, 
1944, 9, 139-150, with permission of the author 
and the American Sociological Association. 


used by various investigators (such as critical 
ratios, biserial correlations, factor analysis, etc.), 
and which give a complete picture of the data 
not afforded by these other techniques. The 
word “picture” might be interpreted here liter- 
ally, for the results of the analysis are presented 
and easily assimilated in the form of a “scalo- 
gram,” which at a glance gives the configuration 
of the qualitative data. 

Description of the practical procedures, as 
well as the mathematical analysis, must be post- 
poned to other papers. The present paper is 
devoted to a non-technical discussion of what 
we mean by a scale. 


2. THE NOTIONS OF VARIABLE, FUNCTION, 
AND SIMPLE FUNCTION 


First, a word about what is meant by a vari- 
able, whether qualitative or quantitative. We 
use the term in its conventional logical or 
mathematical sense, as denoting a set of values. 
These values may be numerical (quantitative) 
or non-numerical (qualitative). We shall use 


1 In conventional courses in undergraduate college 
mathematics it is not ordinarily pointed out that 
a great deal of mathematics deals with purely 
qualitative variables. Notions of metrics and quan- 
titative variables can be arrived at by sequences 
of qualitative classifications, In fact, this is the 
manner in which our approach to scaling derives 
a scale ordering. 

The reader who is interested might look at a 
recent departure in textbooks for an introductory 
course in college mathematics (M. Richardson, 
Fundamentals of Mathematics, Macmillan, 1941). 
This book gives a simple, entertaining, and mature 
introduction to the foundations of mathematics. 
Its emphasis is on understanding, rather than on 
manipulation. It covers fundamental topics like 
point sets, the concept of number, and others that 
are rarely mentioned in ordinary undergraduate 
curricula and yet are mainstays of mathematical 
theory. It is only that most of us have been cx- 
posed exclusively to certain algebraic manipula- 
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the term “attribute” interchangeably with 
“qualitative variable.” The values of an attri- 
bute (or of a quantitative variable, too, for that 
matter) may be called its subcategories, or sim- 
ply categories. 

An example of an attribute is religion. A 
person may have the value “Catholic,” “Bud- 
dhist," "Jewish," “Mormon,” “atheist,” or some 
other value of this variable. There is no par- 
ticular intrinsic ordering among these values. 
Another example is expression of an opinion. 
A person may say, “I like the British,” “I don’t 
like the British,” or “I don't know whether or 
not I like the British.” Another example is, a 
person may be observed to smile at another 
person upon meeting him, or he may be ob- 
served not to smile. 

Quantitative variables are readily recognized 
and need no discussion here. 

A variable y is said to be a single-valued func- 
tion of a variable x if to each value of x there 
corresponds a single value of y. Thus, if y has 
the distinct values yj, Yo, . . . , Ym» and if x 
has the distinct values, x4, X, . . . , X, where 
m and n may be different, y is called a single- 
valued function of x if a table of correspon- 
dence can be set up like, for example, the 
following: 


DODAL 
DODANA 


For each value of x there is one and only one 
value of y. (The converse need not hold: for 
the same value of y there may be two or more 
values of x.) Obviously, if y is to be a single- 
valued function of x, then we must have m € 
n. 

In particular, suppose y is an attribute, say 
like the above attribute about expression of lik- 
ing for the British. Then m —3, and we may 
denote by y, the statement, "I like the British"; 
by yo, the statement, “I don't like the British"; 
and by yg, “I don't know whether or not I like 
the British." If x is a quantitative variable 
which takes on more than m values (n > m), and 
if we can divide the x values into m intervals 


tions that we conceive such manipulations to be 
the essence of mathematics. A more sophisticated 
view is to regard mathematics as unveiling neces- 
sary relationships that arise from classifications. 
Much useless discussion of mathematics as a “tool” 
in social research could be saved by recognition of 
the fact that qualitative classifications lead to just 
as rigorous implications as do quantitative. 


which will have a one-to-one correspondence 
with the values of y, then we shall say the attri- 
bute y is a simple function of x. For example, 
suppose x takes on the ten values 0, 1, 2, 3, 4, 
5, 6, 7, 8, 9. Then the correspondence table 
might be as follows: . 


HUOBBBOBBBUUU 


Or we might show this graphically by plotting 
the x values on a straight line, and cutting it 
into intervals, as in Figure 1. For statistical 


SEG X WU kc 8 9 
Figure 1 


variables, another representation is in terms of 
a bar chart of frequencies, and this is what we 
use for convenience in $10 and $11 below. 


3. THE DEFINITION OF SCALE 


For a given population of objects, the multi- 
variate frequency distribution of a universe of 
attributes will be called a scale if it is possible 
to derive from the distribution a quantitative 
variable with which to characterize the objects 
such that each attribute is a simple function of 
that quantitative variable. Such a quantitative 
variable is called a scale variable. 

Perfect scales are not to be expected in prac- 
tice. The deviation from perfection is measured 
by a coefficient of reproducibility, which is sim- 
ply the empirical relative frequency with which 
the values of the attributes do correspond to the 
proper intervals of a quantitative variable. In 
practice, 85 percent perfect scales or better have 
been used as efficient approximations to perfect 
scales. 

A value of a scale variable will be called a 
scale score, or simply a score. The ordering of 
objects according to the numerical order of 
their scale scores will be called their scale order. 

Obviously, any quantitative variable that is 
an increasing (or decreasing) function of a scale 
variable is also a scale variable. For example, in 
the illustration in 82, consider x to be a scale 
variable. Any constant could be subtracted from 
or added to each of the x scores, and y would 
remain a simple function of the transformed x. 
Thus, the scores 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 could 
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be replaced by the respective scores —5, —4, 
GaP —1; 0; 1, 2,8, 4. Or the x) scores 
could be multiplied by any constant, or their 
square roots or logarithms could be taken—any 
transformation, continuous or discontinuous, 
-could be used, as long as the rank order corre- 
lation between the original x and the trans- 
formed variable remained perfect All such 
transformations will yield scale variables, each 
of which is equally good at reproducing the 
attributes. 

Therefore, the problem of metric is of no 
particular importance here for scaling. For cer- 
tain problems like predicting outside variables 
from the universe of attributes, it may be con- 
venient to adopt a particular metric like a least 
squares metric, which has convenient properties 
for helping analyze multiple correlations. The 
interesting mathematics involved here will be 
discussed in another paper. However, it must 
be stressed that such a choice of metric is a 
matter of convenience; any metric will predict 
an outside variable as accurately as will any 
other. 

In practice, the rank order has been used as 
a scale variable. (It is in fact a least squares 
metric for a rectangular distribution of scale 
scores.) 


4. THE UNIVERSE OF ATTRIBUTES? 


A basic concept of the theory of scales is that 
of the universe of attributes. In social research, 
à universe is usually a large class of behavior 
such as described in the introduction above. 
The universe is the concept whose scalability is 
being investigated, like marital adjustment, 
opinion of British fighting ability, knowledge of 
arithmetic, etc. The universe consists of all the 
attributes that define the concept. Another way 
of describing the universe is to say it consists 
of all the attributes of interest to the investiga- 
tion which have a common content, so that they 
are classified under a single heading which in- 
dicates that content. 

For ease in focusing, let us take an example 
from opinion research where it is desired to 


? The words population and universe are ordi- 
narily used interchangeably in statistical literature. 
For scales, it is necessary to refer both to a com- 
plete set of objects and to a complete set of 
attributes, so it will be convenient to reserve popu- 
lation for the former, and universe for the latter. 
In social research, the objects are usually people, 
so that population is appropriate for them. 


observe the population of individuals in a stan- 
dardized manner by a checklist of questions. 
The behavior of interest to the investigation is 
responses of individuals to such questions. Sup- 
pose the universe of attributes consists of all 
possible questions which could be asked in such 
a list concerning the fighting ability of the Brit- 
ish. Such questions might be: "Do you think 
the British Army is as tough as the German 
Army?"; "Do you think the R.A.F. is superior 
to the Luftwaffe?"; etc. (We do not pause here 
for problems of wording, interpretation, and 
the like. The reader is urged rather to focus on 
the general outline we are trying to establish.) 
There may be an indefinitely large number of 
such questions which belong in the universe; 
and in a particular investigation, ordinarily 
only a sample of the universe is used. 

An attribute belongs to the universe by virtue 
of its content. The investigator indicates the 
content of interest by the title he chooses for 
the universe, and all attributes with that con- 
tent belong in the universe. There will, of 
course, arise borderline cases in practice where 
it will be hard to decide whether or not an 
item belongs in the universe. The evaluation of 
the content thus far remains a matter that may 
be decided by consensus of judges or by some 
other means. This has been recognized before, 
although it need not be regarded as a "sin 
against the Holy Ghost of pure operational- 
ism."? It may well be that the formal analysis 
for scalability may help clarify uncertain areas 
of content. However, we have found it most 
useful at present to utilize informal experience 
and consensus to the fullest extent in defining 
the universe. 

An important emphasis of our present ap- 
proach is that a criterion for an attribute to 
belong in the universe is not the magnitude of 
the correlations of that item with other attri- 
butes known to belong in the universe. It will 
be seen (in 810 below) that attributes of the 
same type of content may have any size of in- 
tercorrelations, varying from practically zero to 
unity.* 


3 Clifford Kirkpatrick, "A Methodological Analysis 
of Feminism in Relation to Marital Adjustment, 
American Sociological Review, June 1939, 4:325- 
334. 

4 That correlations are no criterion for content 
has been quite well known. See, for example, R. F. 
Sletto, Construction of Scales by the Criterion of 
Internal Consistency, Sociological Press, Hanover, 
N.H., 1937. 
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5. THE POPULATION OF OBJECTS 


Defining the universe of attributes is a prob- 
lem similar to the standard problem of defining 
the population of objects or individuals® of in- 
terest to the investigation. An investigator must 
always delimit the population with which he is 
working. For example, in the case of opinion 
about the British as fighters, he must decide 
whose opinions he wishes to ascertain. Is he in- 
terested in everyone in the world, or just in 
everyone in the United States? Is he interested 
in everyone in the United States, or just in 
adults? If just in adults, how is an adult to be 
defined? Here, too, decisions will sometimes be 
difficult as to whether a particular individual 
belongs in a population or not, and decisions 
must be made somehow before the investigation 
begins, else the investigator will not know 
whom to observe. 


6. METHODS OF OBSERVATION 


Let us assume that somehow we have a uni- 
verse of attributes and a population of indi- 
viduals defined. Next, observations are made as 
to the behavior of the population with respect 
to the universe. (In practice this will often be 
done only with samples. A sample of individ- 
uals from the population will have their behav- 
ior observed on a sample of attributes from the 
universe.) How the observations are to be made 
is of no concern to us here. In opinion research 
and other fields, questionnaires and schedules 
have been used, But any technique of observa- 
tion which yields the data of interest to the in- 
vestigation may be used. Such techniques for 
the social and psychological sciences might be 
case histories, interviews, introspection, and any 
other technique from which observations may 
be recorded. The important thing is not how 
the observations were obtained, but that the 
observations be of central interest to the inves- 
tigation. 

Use of a questionnaire implies that the in- 
vestigator is interested in a certain type of 
universe of verbal behavior. Participant observa- 
tion may imply that the investigator is inter- 
ested in a certain type of universe of non-verbal 
behavior. Such distinct universes may each be 
investigated separately. It may often be of in- 
terest to see how well one universe correlates 


5 For convenience, since the examples in this pa- 
per concern populations of human beings, we shall 
talk entirely in terms of such populations. 


with another, but such a correlation cannot be 
investigated until each universe is defined and 
observed in its own right. 

The examples of scales to be given later in 
this paper happen to comprise observations 
made by means of questionnaires. It should not 
be inferred, however, that scaling refers only to 
that technique. Scaling analysis is a formal 
analysis, and hence applies to any universe of 
qualitative data of any science, obtained by any 
manner of observation. 


7. THE PURPOSE OF SCALING 


Obviously it is very clumsy to record the large 
number of observations ordinarily involved in 
a universe of attributes for a population of in- 
dividuals. The recording requires a table with 
one row for each individual and one column 
for each attribute. (The table may theoretically 
be indefinitely large.) It would be convenient 
if we could represent the observations in a more 
compact manner which would enable us to re- 
produce such a table whenever desired. A com- 
pact representation, if it could be obtained, 
would have two great advantages: first, a mne- 
monic advantage, for a compact representation 
would be easier to remember than would be a 
large table; and second, if it were desired to 
relate the universe to other variables it would 
be easier to do so by means of the compact 
representation than by using the large multi- 
variate distribution of the attributes in the uni- 
verse. From these are derived other advantages 
which will become apparent as the reader's 
familiarity with scales grows. 

A particularly simple representation of the 
data. would be to assign to each individual a 
numerical value and to each category of each 
attribute a numerical value such that, given the 
value of the individual and the values of the 
categories of an attribute, we could reproduce 
the observations of the individual on the attri- 
bute. This will be possible only for restricted 
types of data, where each attribute in the uni- 
verse can be expressed as a simple function of 
the same quantitative variable, that is, where 
the universe of attributes forms a scale for the 
population of individuals. 


8. AN EXAMPLE OF A DICHOTOMOUS 
SCALE 


As may be expected, the universe of attributes 
must form a rather specialized configuration for 
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the population of individuals if it is to be scal- 
able. Before describing a more general case, let 
us give a little example. (A sociological interpre- 
tation of this apparently mathematical example 
is given in 815 below.) Consider a mathematics 
test composed of the following problems: 


(a) If r is the radius of a circle, then what is 
its area? 


(b) What are the values of x satisfying the 
equation 
ax? + bx --c— 0? 


(c) What is de®/dx? 


If this test were given to the population of 
members of the American Sociological Society, 
we would perhaps find it to form a scale for 
that population. The responses to each of these 
questions might be reported as a dichotomy, 
right or wrong. There are 2x 2x 2 — 8 possi- 
ble types for three dichotomies. Actually, for 
this population of sociologists we would prob- 
ably find only four of the possible types occur- 
ring. There would be the type which would get 
all three questions right, the type which would 
get the first and second questions right, the type 
which would get only the first question right, 
and the type which would get none of the ques- 
tions right. Let us assume that this is what 
would actually happen. That is, we shall assume 
the other four types, such as the type getting 
the first and the third questions right but the 
second question wrong, would not occur. In 
such a case, it is possible to assign to the popu- 
lation a set of numerical values like 3, 2, 1, 0. 
Each member of the population will have one 
of these values assigned to him. This numerical 
value will be called the person's score. From a 
person's score we would then know precisely to 
which problems he knows the answers and to 
which he does not know the answer. Thus a 
score of 2 does not mean simply that the person 
got two questions right, but that he got two 
particular questions right, namely, the first and 
second. A person's behavior on the problems is 
reproducible from his score. More specifically, 
each question is a simple function of the score, 
as is shown in 810 below. 


9. THE MEANING OF "MORE" AND "LESS" 


Notice that there is a very definite meaning 
to saying that one person knows more mathe- 
matics than another with respect to this sam- 
ple. For example, a score of $ means more than 


a score of 2 because the person with a score of 
3 knows everything a person with a score of 2 
does, and more. 

There is also a definite meaning to saying 
that getting a question right indicates more 
knowledge than getting the same question 
wrong, the importance of which may not be too 
obvious. People who get a question right all 
have higher scale scores than do people who 
get the question wrong. As a matter of fact, we 
need no knowledge of which is a right answer 
and which is a wrong answer beforehand to es- 
tablish a proper order among the individuals. 
For convenience, suppose the questions were 
given in a "true-false" form,® with suggested an- 
swers 2IIr, (— b + vV b? — 4ac) | 2a, and xe?—1 
for the respective questions. Each person re- 
cords either a T or an F after each question, 
according as he believes the suggested answers 
to be true or false. If the responses of the pop- 
ulation form a scale, then we do not have to 
know which are the correct answers in order to 
rank the respondents (only we will not know 
whether we are ranking them from high to low 
or from low to high) By the scale analysis, 
which essentially is based on sorting out the 
joint occurrences of the three items simultane- 
ously, we would find only 4 types of persons 
occurring. One type would be F,T,F,, where 
the subscripts indicate the questions; that is, 
this type says F to question 1, T to question 2, 
and F to question 3. The other three types 
would be FT T, F;F;T;, and T,F;T;. These 
types could be shown in a chart (a “scalogram”) 
where there is one row for each type of person 
and one column for each category of each attri- 
bute. Without going into details, the scale anal- 
ysis would establish an order among the rows 
and among the columns which would finally 
look as shown in Figure 2. 


F T,F, T,F,T,; 
vvv 
Vv V 
vvv 
vvv 


Figure 2 


Or, alternatively, both rows and columns 
might be completely reversed in order. Each 


6 We shall assume that no one gets an answer 
right by guessing. In a later paper it will be shown 
how scale analysis can actually pick out responses 
that were correct merely by guessing. But for this, 
much more than three items are necessary. 
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response to a question is indicated by a check 
mark. Each row has three checkmarks because 
each question is answered, either correctly or 
incorrectly. The “parallelogram” pattern in the 
chart? is necessary and sufficient for a set of di- 
chotomous attributes to be expressible as simple 
functions of a single quantitative variable. 
From this chart we can deduce that F}, T», 
and F, are all correct answers, or are all incor- 
rect answers. That is, if we were now told that 
F, is a correct answer, we would immediately 
know that T, and Fy are also correct answers. 
This means that we can order the men accord- 
ing to their knowledge even if we do not know 
which are the correct answers and which are 
the incorrect answers, only we do not know 
whether we are ordering them from highest to 
lowest or from lowest to highest. Except for di- 
rection, the ordering is a purely formal conse- 
quence of the configuration of the behavior of 
the population with respect to the items. The 
importance of this fact will become more ap- 
parent in more complicated cases where the 
attributes are not dichotomous but have more 
than two categories. We do not take the space 
here to expand on this point, but merely state 
that the scale analysis automatically decides, for 
example, where an "undecided" response to a 
public opinion poll questionnaire belongs, 
whether it is above "yes," below "no," in be- 
tween, equivalent to “yes,” or equivalent to 


“ D 


no." 


10. THE BAR CHART REPRESENTATION 


Another way of picturing the dichotomous 
scale of the sample of three items would be as 
follows: suppose that 80 percent of the popula- 
tion got the first question right, 40 percent got 
the second question right, and 10 percent got 
the third question right. The univariate distri- 
butions of the three respective items could be 
shown by the bar chart in Figure 3. 

The bars show the percentage distributions 
for the respective questions. The multivariate 
distribution for the three questions, given that 
they form a scale for the population, can also 
be indicated on the same chart, since all those 
who are included in the group getting a harder 
question right are also included in the group 


7 Such a chart, where one column is used for each 
category of each attribute, we call a scalogram. 'The 


scalogram boards used in practical procedures are 
simply devices for shifting rows and columns to 
find a scale pattern if it exists. 


getting an easier question right. Thus, we could 
draw the bar chart over again, but connect the 
bars with dashed lines in the fashion shown in 
Figure 4. Here we can see how the three ques- 


(a) 


Frequency 20% 40% 30% 105 
Score 0 1 2 3 
Figure 4 


tions are simple functions of the scores. From 
the marginal frequencies of the separate items, 
together with the fact that the items form a 
scale, we are enabled to deduce that 10 percent 
of the people got a score of 3. The 10 percent 
who got the hardest question right are included 
in those who got the easier questions right. 
This is indicated by the dashed line on the 
right, between the scores 2 and 3, which carries 
the same 10 percent of the people (those with . 
a score of 3) through the three bars. The 40 
percent who got the second question right in- 
clude the 10 percent who got the hardest ques- 
tion right and 30 percent out of those who got 
the hardest question wrong, but all 40 percent 
got the easiest question right. This leaves us 30 
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percent who got just the first and second ques- 
tions right. And so on. Thus we can think of 
an ordering of the persons along a horizontal 
axis, and each item can be thought of as a cut 
on that axis. All those above the cutting point 
get the question right and all those below the 
cutting point get the question wrong. Thus 
there is a one-to-one correspondence between 
the categories of an item and segments of the 
axis. Or we can say that each attribute is a 
simple function of the rank order along the 
axis. 
It is because all the items in the sample can 
be expressed as simple functions of the same 
_ ordering of persons that they form a scale. Each 
item is perfectly correlated with or reproducible 
from the ordering along the axis. However, the 
point correlations between the items are not at 


all perfect. For example, the four-fold table i 


between the second and third items is as shown 
in the table. 


Question (b) 
Right Wrong 


Right 10 
Question (c) 


Wrong 90 


40 60 100 


The point correlation between the two items is 
A41. As a matter of fact, the point correlation 
between two dichotomous items may be any- 
thing from practically zero to unity, and yet 
they may both be perfect functions of the same 
quantitative variable. That this may be para- 
doxical might be explained by inadequate treat- 
ment of qualitative variables in conventional 
courses and textbooks on statistics. 


8 Technical Footnote. A tetrachoric coefficient for 
the four-fold table above, assuming a bivariate 
normal distribution, would be unity, However, 
this is not the correlation between the items. It 
does not tell how well one can predict one item 
from the other. The tetrachoric coefficient ex- 
Presses instead the correlation between two quan- 
titative variables of which the items are functions, 
provided the assumptions of normality are true. 
The reason the tetrachoric is unity in this case is 
that the quantitative variables of which the items 
are functions are one and the same variable, 


An important feature of this four-fold table 
is the zero frequency in the upper right-hand 
corner cell. Nobody who got the third question 
right got the second question wrong. Such a zero 
cell must always occur in a four-fold table be- 
tween two dichotomous items which are simple 
functions of the same quantitative variable. 


11. ANOTHER EXAMPLE OF A SCALE 


Now let us give an example of a more com- 
plicated scale. Suppose we were interested in 
finding out how much desire soldiers may ex- 
press now about going back to school after the 
war is over. Suppose that out of the universe of 
attributes which define this desire we select the 
following sample of four questions to be pre- 
sented on a questionnaire. 


1. If you were offered a good job, what would 
you do? 
(a) I would take the job 
(b) I would turn it down if the govern- 
ment would help me go to school 
(c) I would turn it down and go back to 
school regardless 
2. If you were offered some kind of job, but 
not a good one, what would you do? 
(a) I would take the job 
(b) I would turn it down if the govern- 
ment would help me go to school 
(c) I would turn it down and go back to 
school regardless 
8. If you could get no job at all, what would 
you do? 
(a) I would not go back to school 
(b) If the government would aid me, 1 
would go back to school 
(c) 1 would go back to school even with- 
out government aid 
4. If you could do what you like after the war 
is over, would you go back to school? 
(a) Yes 
(b) No 


Let us suppose the responses of the men to 
these questions form a scale in the manner 
shown in Figure 5. 


namely, the scale variable. Notice, however, that 
the distribution of the scale variable according to 
the rank order is not at all normal. One of the 
contributions of scaling theory is to do away with 
untested and unnecessary hypotheses about. normal 
distributions. It is the point correlation that is 
involved in the mathematical analysis of scaling, 
not the tetrachoric. 
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Good i would turn 
job Would take a good job (70%) aa Eod 
job (20%) 
Some 
kind Would take some kind of job (60%) would turn down some kind of job 
of job some kind of job (20%) 
(20%) 
No If government aided, would Would go back to 
job Would nat pd be schiach go back to school school even without 
at all (35%) (40%) aid (25%) 
! Hi 1 T T 
i 
voro fer eel il lag 
i I l i isa 
Like 
to go No, would not like to go to school (50%) Yes, would like to go to school (50%) 
to school 
Frequency 35% 15% 10% 10% 5% 5% 10% 10% 
Score 0 1 2 3 Aon 6 7 
Figure 5 


We now know how to read such a chart. Ten 
percent of the men said they would turn down 
a good job to go back to school; 20 percent said 
they would turn down a good job only if the 
government aided them; 70 percent said they 
would take a good job; and so on. The 10 per- 
cent who said they would turn down a good 
job are included in the 20 percent who said 
they would turn down some kind of a job, and 
the 20 percent are included in the 25 percent 
who said they would go back to school if they 
got no job at all, and these 25 percent are in- 
cluded in the 50 percent who said they would 
like to go back to school. 

For three trichotomous and one dichotomous 
questions there are 3 X 3 x 3 X 2= 54 possible 
types. In order for these to form a scale, it can 
be shown that at most eight types can occur. 
The chart shows the eight types, which have 
been scored from 0 through 7. The chart shows 
the characteristic of each type. For example, 
the type with the score 3 includes all men with 
the following four values: they say that they 
would take a good job if it were offered to them 
rather than go back to school; that they would 
turn down some kind of job if the government 
would aid them to go back to school; that they 
would go back to school if the government 


would aid them if they could get no job at all; 
and that they would like to go back to school. 
Thus, by reading the categories crossed by the 
dashed lines which enclose each type, we can 
read off the characteristics of the type. 

Notice that each of the four attributes is a 
simple function of the scale scores. For example, 
the "good job" question has its categories cor- 
respond with the following three intervals of 
the scale scores: 0-3, 4-6, 7. 

The question might be raised as to how often 
will scales be found in practice. Isn't even a 
fair approximation to a structure like that in 
the above chart too much to hope for to be 
found in real life? Towards an answer to this, 
we can only cite thus far our experience with 
research in the Army. Literally dozens of suffi- 
ciently perfect scales have been found in var- 
ious areas of attitude, opinion, and knowledge. 
The example given above of desire to go to 
school is a fictitious version of a set of similar 
questions that have actually proved scalable for 
the Army. Many varieties of data have been 
found scalable, and many have not. Those data 
which proved scalable could then be related to 
other variables very easily. Those that were not 
scalable required a more complicated analysis 
to handle them properly. 
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12. ON SAMPLING THE UNIVERSE OF 
ATTRIBUTES 


An important property of a scalable universe 
is that the ordering of persons based on a sam- 
ple of items will be essentially that based on 
the universe. If the universe is a scale, what the 
addition of further items would do would be 
merely to break up each type given by the sam- 
ple into more differentiated types. But it would 
not interchange the order of the types already 
in the sample. For example, in Figure 5 above, 
type 6 would always have a higher rank order 
than type 5. People in type 6 might be ordered 
within the type into more subcategories; people 
within type 5 might be ordered into more sub- 
categories; but all subcategories within 6 would 
remain of higher rank than all those in type 5. 
This may be seen in reverse, for example, by 
deleting one of the questions and noticing that 
all that is accomplished is to collapse the num- 
ber of types to a smaller number so that two 
neighboring types may now become indistin- 
guishable; but any types two steps apart would 
still remain in the same order with respect to 
each other. 

Hence, we are assured that if a person ranks 
higher than another person in a sample of 
items, he will rank higher in the universe of 
items. This is an important property of scales, 
that from a sample of attributes we can draw 
inferences about the universe of attributes. 

One of the criteria for selecting a sample of 
items is to choose a sample with enough cate- 
gories to provide a desired amount of differen- 
tiation between individuals. Thus if individuals 
are desired to be differentiated say only into 10 
groups, items should be chosen which will yield 
10 types? The shape of the distribution of the 
rank orders in a sample of attributes will of 
course depend upon the sample. One sample of 
attributes may give one shape distribution; an- 
other sample may give another shape distribu- 
tion. This need not be a matter of concern, 
since our primary interest lies in the ordering 
of people, not the relative frequency of each 
position, 

It might be asked how can one know the uni- 
verse forms a scale if all one knows is a sample 
from the universe. At present it seems quite 
clear that in general the probability of finding 


9 We are of course not considering problems of 
reliability in the sense of repeated observations of 
the same attributes. For convenience, we are tac- 
itly assuming perfect reliability. 


a sample of attributes to form a scale by chance 
for a sample of individuals is quite negligible, 
even if there are as few as three dichotomous 
items in the sample and as many as one hun- 
dred individuals.1° It seems quite safe to infer 
in general that if a sample of attributes is se- 
lected without knowledge of their empirical in- 
terrelationships and is found to form a scale for 
any sizeable random sample of individuals, then 
the universe from which the attributes are se- 
lected is scalable for the entire population of 
individuals. 


13. SCALING AND PREDICTION 


It is important to distinguish between two 
closely related topics, scaling and prediction. 
Finding that a universe of attributes is scalable 
for a population means that it is possible to 
derive a quantitative variable from the multi- 
variate distribution such that each attribute is 
a simple function of that variable. We might 
phrase this otherwise by saying that each attri- 
bute is (perfectly) predictable from the quanti- 
tative variable. 

This is the converse of the ordinary problem 
of prediction. In an ordinary problem of pre- 
diction, there is an outside variable, indepen- 
dently defined, that is to be predicted from the 
attributes. For example, it might be desired to 
predict the income of a student five years after 
he graduates from college, from his present 
knowledge of mathematics, To do this, an ex- 
perimental sample would have to be obtained 
where salaries five years after college are known 
for each person and where responses to each 
item on the mathematics test are known. If the 
criterion of least-squares is adopted, then the 
best prediction on the basis of the sample 
would be the multiple regression of income on 


10 Technical Footnote. To work out the complete 
probability theory would require two things: first, 
a definition of a sampling process for selecting 
items, and second, a definition of what is meant 
by a scale not existing. A definition of the sampling 
process is difficult because items are ordinarily 
developed intuitively. Stating a null hypothe- 
sis that a scale does not exist leads to many 
possible analytical formulations, for different lim- 
iting conditions may be imposed upon the multi- 
variate distribution of the items. For example, 
should the marginal frequencies be considered fixed 
in all samples, should the bivariate frequencies be 
considered fixed, etc? These are questions which 
may become clearer as the theory of scaling devel- 
ops, and in return may clarify our conceptions of 
what observation of social phenomena implies. 
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the three items in the sample. The multivariate 
distribution of the three items and the outside 
variable would give the necessary data for com- 
puting the regression, curvilinear or linear, 
which would be best for predicting the outside 
variable. If we wished to predict some other 
outside variable from the same items, a new 
multiple regression would have to be worked 
out from the multivariate distribution of the 
three items and the new outside variable. In 
general, the first of these regressions would ordi- 
narily be expected to differ from the second. In 
general, weights to be used to predict one out- 
side variable from a set of attributes will differ 
from those used to predict another outside vari- 
able; a new multiple regression must be worked 
out for each outside variable. 

This emphasizes an important property of 
scales. If the items have a multivariate distribu- 
tion that is scalable, it can easily be seen that 
no matter what the outside variable may be, 
the same prediction weights may be given to the 
items. The correlation of any outside variable 
with the scale scores is precisely the same as the 
multiple correlation of that outside variable 
with the items in the scale. Thus we have an 
outstanding property of scaling, namely, that it 
provides an invariant quantification of the at- 
tributes for predicting any outside variable. No 
matter what prediction purpose is to be served 
by the attributes, the scale scores will serve that 
purpose, 


14. ON “ITEM ANALYSIS” 


Let us repeat the distinction just made. In 
scaling we reproduce the attributes from a quan- 
titative variable. In prediction, we predict a 
variable from the attributes. This is a sharp dif- 
ference which enables us to avoid much of the 
confusion that seems to prevail in the previous 
literature on scale construction. It seems to have 
been felt that items in a universe are merely 
stepping stones from which to obtain scores. It 
seems to have been felt that it was an embar- 
rassing deficiency to lack a particular variable 
to predict from the items—that as a necessary 
evil one had to resort to methods of internal 
consistency to derive scores. 

This accounts for current “item analysis” ap- 
proaches to scaling. These use procedures that 
are typically as follows. A trial set of weights is 
assigned the categories, yielding a trial set of 
scores, Then each item is examined to see how 
well it by itself discriminates between these 


scores, that is, how well the scores can be pre- 
dicted from the item. Those items which indi- 
vidually discriminate best are retained, and the 
others eliminated. 

The misleading character of such procedures 
can be seen by inspection of the examples of 
scales in §10 and §11 above. We have pointed 
out that the intercorrelations between attri- 
butes in a scale can be as close to zero as one 
pleases. It can also easily be seen that the cor- 
relation ratio of the scale scores with any single 
item can also be as close to zero as one pleases. 
The predictability of the scale variable from an 
attribute does not tell whether or not the attri- 
bute is predictable from the scale variable. 

The use of the “item analysis” procedures in 
connection with scales seems to be an unfortu- 
nate carry-over from the problem of ordinary 
prediction of an outside variable. In such a pre- 
diction problem, the items are truly but step- 
ping stones to enable predictions to be made. 
It is knownH that item analysis affords a first 
approximation to multiple correlation (or the 
discriminant function), and an item is of inter- 
est only insofar as it aids in the multiple 
regression. 

Our emphasis for scaling is quite different. In 
scaling, we are interested in each and every at- 
tribute in the universe on its own merits. If we 
were not, we would not work with the universe. 
The attributes are the important things; and if 
they are scalable, then the scores are merely a 
compact framework with which to represent 
them. 

If a compact framework is found, it has the 
additional important property of being an effi- 
cient device for predicting any outside variable 
in the best manner possible from the given uni- 
verse of attributes. 


15. THE RELATIVITY OF SCALES 


An interesting problem associated with scales 
is: why does a universe form a scale for a given 
population? For example, take the sample of 
three mathematics questions given above. Why 
should these three questions be scalable? There 
is no necessary logical reason why a person must 
know the area of a circle before he can know 
what a derivative is, and in particular the de- 
rivative of e”. The reason for a scale emerging 


11 See, for example, Louis Guttman, "An Outline 
of the Statistical Theory of Prediction," in Paul 
Horst et al, The Prediction of Personal Adjust- 
ment, Social Science Research Council, 1941. 
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in this case seems largely cultural. Our educa- 
tional system is such that the sequence with 
which we learn our mathematics in our high 
schools and colleges is first to get things such as 
areas of circles, then algebra, and then calculus. 
And the amount of drill that we have on each 
of these topics is probably also in that order. It 
would be quite possible, however, for the pro- 
verbial "man from Mars" to come to this earth 
and study calculus without having to learn the 
area of a circle, so that he might not be a scale 
type, according to the scale presented above; or 
a student may have had some personal incident 
which somehow impressed upon him with great 
force the derivative of e7, but in the ordinary 
course of circumstances would have forgotten it 
even more readily than he forgot the area of a 
circle. 

The scale analysis will pick out such deviants 
or non-scale types. Of course, if these non-scale 
types are too numerous, we shall not say that a 
scale exists. In practice we find scales, although 
never perfect scales, only because there has been 
sufficient uniformity of experience for the pop- 
ulation of individuals so that the attributes 
mean essentially the same thing to the different 
individuals. As a matter of fact a study of the 
deviants is an interesting by-product of the scale 
analysis. Scale analysis actually picks out indi- 
viduals for case studies. 

A universe may form a scale for a population 
at a given time and may not at a later time. 
For example, the items in the scale of expres- 
sion of desire of American soldiers to go back 
to school after the war may not prove to be 
scalable if they were asked once more at the 
close of the war. 

A universe may form a scale for one popula- 
tion of individuals, but not for another. Or the 
attributes may form scales for two populations 
in different manners. For example, a sample of 
items of satisfaction with Army life which 
formed a scale for combat outfits in the Air 
Force did not form a scale for men in the tech- 
nical schools of the Air Force. The structure of 
camp life for these two groups was too different 
for the same items to have the same meaning 
in both situations. 

If a universe is scalable for one population 
but not for another population, or forms a 
scale in a different manner, we cannot compare 
the two populations in degree and say that one 
is higher or lower on the average than another 
with respect to the universe. They differ in 


more than one dimension, or in kind rather 
than in degree. It is only if two groups or two 
individuals fall into the same scale that they can 
be ordered from higher to lower. A similar con- 
sideration holds for comparisons in time. An 
important contribution of the present theory of 
scaling is to bring out this emphasis quite 
sharply. 


16. SUMMARY 


1. The multivariate frequency distribution of 
a universe of attributes for a population of ob- 
jects is a scale if it is possible to derive from 
the distribution a quantitative variable with 
which to characterize the objects such that each 
attribute is a simple function of that quantita- 
tive variable. 

2. There is an unambiguous meaning to the 
order of scale scores. An object with a higher 
score than another object is characterized by 
higher, or at least equivalent, values on each 
attribute. 

3. There is an unambiguous meaning to the 
order of attribute values. One category of an 
attribute is higher than another if it character- 
izes objects higher on the scale. 

4. It can be shown that if the data are scal- 
able, the orderings of objects and of categories 
are in general unique (except for direction). 
Both orderings emerge from analysis of the 
data, rather than from a priori considerations. 

5. The predictability of any outside variable 
from the scale scores is the same as the predict- 
ability from the multivariate distribution with 
the attributes. The zero order correlation with 
the scale score is equivalent to the multiple cor- 
relation with the universe. Hence, scale scores 
provide an invariant quantification of the attri- 
butes for predicting any outside variable what- 
soever. 

6. Scales are relative to time and to popula- 
tions, 

(a) For a given population of objects, a 
universe may be scalable at one time 
but not at another, or it may be scal- 
able at two periods of time but with 
different orderings of objects and cat- 
egories. 

(b) A universe may be scalable for one 
population but not for another, or it 
may be scalable for two populations 
but with different orderings of objects 
and categories. 
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(c) Comparisons with respect to degree 
can be made only if the same scaling 
obtains in both cases being compared. 

7. From the multivariate distribution of a 
sample of attributes for a sample of objects, in- 
ferences can be drawn concerning the complete 
distribution of the universe for the population. 

(a) The hypothesis that the complete dis- 
tribution is scalable can be adequately 
tested with a sample distribution. 

(b) The rank order among objects ac- 
cording to a sample scale is essentially 
that in the complete scale. 

(c) The ordering of categories in a sam- 


ple scale is essentially that in the 
complete scale. 
8. Perfect scales are not found in practice. 

(a) The degree of approximation to per- 
fection is measured by a coefficient of 
reproducibility, which is the empir- 
ical relative frequency with which 
values of the attributes do correspond 
to intervals of a scale variable. 

(b) In practice, 85 percent perfect scales 
or better have been used as efficient 
approximations to perfect scales. 

9. In imperfect scales, scale analysis picks out 
deviants or non-scale types for case studies. 


13. Cross-Cultural Comparability in Attitude Measurement 
via Multilingual Semantic Differentials 


CHARLES E. OSGOOD 


The world is rapidly shrinking—politically, so- 
cially, and psychologically. Recent developments 
in the technology of transportation and commu- 
nication are annihilating both space and time. 
These same developments are making it possi- 
ble to conduct social and behavioral science 
research on an international scale that is cer- 
tainly rewarding scientifically and perhaps es- 
sential practically, if we are to survive along 
with our technology. Indeed, there are many 
hypotheses about human nature that demand 
cross-national designs, if we are to successfully 
disentangle what is common to the human spe- 
cies from what is specific to a particular lan- 
guage or culture. 

But comparisons across cultures are extremely 
difficult for what anthropologists call nonmate- 
rial traits—things such as values, customs, atti- 
tudes, feelings, and meanings. Many years ago 
Edward Sapir and Benjamin Lee Whorf 
phrased what would now be called the hypothe- 
sis of psycholinguistic relativity. According to 
this hypothesis, how we perceive, how we think 
and even how we formulate our basic philoso- 
phies depend upon the structure of the lan- 
guage we speak. If this were literally and com- 
pletely true, then comparisons across language 
barriers would be impossible. The Sapir-Whorf 
hypothesis has been shown to apply to cer- 
tain aspects of language—for example, the way 
in which the lexicon of any language arbitrarily 


* Reprinted from I. Steiner and M. Fishbein (Eds.), 
Current Studies in Social Psychology, Holt, Rine- 
hart and Winston, New York, 1965, with permis- 
sion of the author and the publisher. This article 
is a slightly abridged version of a paper read 
before the International Congress of Psychology, 
August 23, 1963. Professors William K. Archer, 
ethnolinguist, and Murray $. Miron, psycholin- 
guist, were coinvestigators in the research, which 
was supported by the Human Ecology Fund. 


carves up the world denotatively does influ- 
ence the perceptual and conceptual processes 
of its users. But, our research is making it clear 
that there are other aspects of language—par- 
ticularly the way it represents affect and the 
way affect mediates metaphor and symbolism 
—for which universality rather than relativity 
seems to be the rule. 

Let me begin by asking you to do the im- 
possible—to imagine a space of some unknown 
number of dimensions. This will be our hypo- 
thetical semantic space. Just like all self-respect- 
ing spaces, this one has an origin, which we 
define as complete "meaninglessness"—this is 
like the neutral grey center of the color space. 
If we think of the meaning of any word or 
concept as being some particular point in this 
space, then we could represent it by a vector 
out from the origin to that point (for example, 
x and y in Figure 1). The longer the vector, 
the further out in semantic space (concept x), 
the more "meaningful" the concept; the shorter 
the vector, the nearer the origin of the space 
(concept y), the less intensely meaningful the 
concept—this being analogous to saturation in 
the color space. Vectors may also vary in their 
direction within this n-dimensional space, and 
we equate direction with the "quality" of mean- 
ing—like the way colors may vary in hue, in- 
cluding the white to black axis. It should also 
be noted that if we are dealing with an Eu- 
clidean space—which is the simplest assump- 
tion about Nature to start with—then the less 
the distance between the endpoints of vectors 
in our semantic space, the more similar in 
meaning should be the concepts they represent. 

One more analogy with the color space will 
prove useful to us: Just as complementary col- 
ors are defined as points equidistant and in 
opposite directions from the origin of the color 
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space, which when mixed together in equal 
proportions cancel each other out to neutral 
grey, so we may think of verbal opposites as 
defining straight lines through the origin of the 
semantic space and cancelling each other out 
to meaningless when mixed. As a matter of 
fact, this is exactly the way dictionary-makers 
define pure or logical opposites in language— 
their meanings cancel each other out, com- 
ponent for component. 

Now imagine a whole set of different straight- 
line "cuts" through this space, as suggested by 
the fine lines in Figure 1. Each would be de- 
fined by a different pair of opposites—hard- 
soft, excitable-calm, good-bad, fair-unfair, hot- 
cold, noisy-quiet, large-small, and so forth— 
creating a veritable pincushion of qualitative 
dimensions. In order to locate a particular 
person's meaning of a particular concept, say 
"white rose buds," we might now play a spe- 
cial game of “Twenty Questions" with him: 
(1) Is this concept beautiful or ugly? It is 
beautiful—so it must be in the upper half of 
the total space. (2) Is it hard or soft? It is soft 
—so it must be upward and to the right. (3) Is 
it noisy or quiet? It is quiet—so it must be in 
the octants away from us rather than near us 
in Figure 1. Thus, with only three binary ques- 
tions, we could decide in which of eight octants 
of the space was "white rose buds"; or, if each 
straight-line "cut" were scaled into seven dis- 


criminable steps, for example, from extremely 
beautiful through neutral to extremely ugly, 
as we have actually done in our work, then 
each decision would reduce uncertainty by six- 
sevenths, and only three “cuts” would differen- 
tiate a space having 343 discrete regions. 

But to talk about “directions” in any space 
one has to have some reference coordinates. 
Is the up-down, north-south, east-west of the 
semantic space to be completely arbitrary, or 
is there some "natural" built-in structuring, 
analogous to the gravitational and magnetic 
determinants of geophysical space? This ques- 
tion is an empirical one, and the logical tool is 
some variant of factor analysis. We need to 
take large and representative samples of scales 
defined by verbal opposites, determine their 
patterns of correlation when used in judging 
varied samples of concepts by large and varied 
samples of subjects, and see if in fact they do 
fall into "natural" clusters or factors. We will 
want to determine if these factors are orthogo- 
nal, at right angles to each other, and if they 
are stable, that is, repeatable over independent 
samples of people, concepts and scales. 

When a group of people judge a set of con- 
cepts against a set of semantic scales, a cube 
of data is generated, as shown in Figure 2. The 
rows are defined by scales, the columns by the 
concepts being judged, and the “slices” from 
front to back by the subjects doing the judg- 


Figure 1. Hypothetical semantic space. 
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Concepts 


Figure 2. A cube of data. 


ing. Each cell represents with a single value 
how a particular subject rated a particular con- 
cept against a particular scale. We are usually, 
but not necessarily, interested in the correla- 
tions among the scales; we may obtain these 
correlations across subjects or across scales or 
across both simultaneously. We may do sepa- 
rate analyses for single subjects or single con- 
cepts, or we may collapse either the subject 
dimension (if we are interested in cultural 
meanings) or the concept dimension (if we are 
interested in concept class characteristics). In 
other words, there are many ways we can slice 
this semantic cake, and each is appropriate for 
answering a different kind of question. 
During the past decade or more we have 
analysed many such data cubes collected from 
English-speaking Americans. Much of this 
work has been summarized by Osgood, Suci, 
and Tannenbaum (1957). Despite deliberate 
and independent variations in the rules for 
sampling scales and concepts and in the kinds 
of subjects used, three dominant, orthogonal 
factors have kept reappearing: An “evaluative” 
factor (represented by scales like good-bad, 
kind-cruel, and honest-dishonest), a “potency” 
factor (represented by scales like strong-weak, 
hard-soft and heavy-light) and an "activity" 
factor (represented by scales likes active-passive, 
fast-slow and hot-cold). What this means is that 


there are at least three "directions" in the 
semantic space that are regions of relatively 
high density, in the sense of containing many 
highly correlated scales representing similar 
modes of qualifying. Evaluation, potency and 
activity appear to be the most salient modes 
of qualifying experience. Of course, there are 
additional factors, other dimensions of the 
semantic space—indeed, a long train of them 
of decreasing importance and increasing 
uniqueness. 

Is this basic evaluation-potency-activity frame- 
work common to all people? Within the 
English-speaking American community we have 
made many comparisons—between old people 
and young, between males and females, be- 
tween Eisenhower Republicans and Stevenson 
Democrats, and even between schizophrenics 
and normals. The results of all these and many 
more comparisons can be stated quite suc 
cinctly: In no case has there been convincing 
evidence for differences in the underlying fac- 
tors, But is it possible that this semantic sys 
tem is restricted to Americans who speak the 
English language? Let me now tell you some- 
thing about our attempt to determine if this 
affective meaning system varies with language 
and culture or is, indeed, panhuman. 

In the course of our earlier work we had 
already made a number of cross-cultural com- 
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parisons—involving Japanese, Koreans, Greeks, 
and Navajo, Zuni, and Hopi Indians in the 
American Southwest—and the similarities in 
factor structure were striking. But, for the most 
part, these studies involved simply translating 
American English scales into the various lan- 
guages under investigation, and we were open 
to the criticism that we were forcing people 
of other language/culture communities to op- 
erate within the limits imposed by our own 
semantic system. During the past four years 
we have been trying to test the generality of 
the affective semantic system under the most 
rigorous conditions we could devise. With the 
cooperation of senior social scientists and their 
associates in some fifteen countries around the 
world and a staff at the University of Illinois, 
we have been collecting basic semantic data 
cubes "from scratch," so to speak, testing the 
generality of the factor systems, and, on the 
basis of the generality revealed, constructing 
comparable semantic measuring instruments. 

We start with a list of 100 familiar concepts 
that have been selected by linguists and anthro- 
pologists as being "culture fair" and that have 
survived a stringent translation test with bi- 
linguals in all of the six language families 
represented in our sample. (We have been 
working with Japanese, Cantonese in Hong 
Kong, Kannada in Southern India, Hindi in 
Northern India, Farsi in Afghanistan and Iran, 
Arabic in Lebanon, Turkish, Finnish, Serbo- 
Croatian in Yugoslavia, Polish, and a number 
of Western Indo-European languages: Dutch, 
Flemish, French, Swedish, and English.) "This 
original list of 100 translation-equivalent con- 
cepts, like "house," "man," "sky," "hand," 
"future," "dog," and "anger," is the only point 
at which translation is involved and could in- 
fluence the results. From this point on, every- 
thing is done in the native language, with 
native monolingual subjects and with indige- 
nous research personnel. 

The first step is to have 100 young high- 
school-level boys in each country give the first 
qualifiers (adjectives, in English) that occur to 
them when each of the 100 concepts is given 
as a stimulus—for example, to the word for 
"tree" in his language one boy might say tall, 
another green, another big, and so forth. This 
basketful of 10,000 qualifiers (100 subjects times 
100 concepts) is shipped to Illinois, where, us- 
ing IBM and Illiac high-speed computers, we 
determine a rank order of these various ways 
of qualifying experience in terms of their total 


frequency of usage, diversity of usage across 
the 100 nouns and independence of usage 
with respect to each other—in other words, we 
are looking for the most characteristic, produc- 
tive and independent qualitative dimensions 
in each language. Even at this level, similari- 
ties begin to emerge. Not only are the rank- 
frequency distributions of qualifiers statistically 
similar, but when the ranked qualifiers in vari- 
ous languages are translated into English and 
correlated, the correlations are all in the .50 to 
-70 range and highly significant. In other words, 
despite the difficulty of “mapping” one lan- 
guage into another in translation, the dominant 
modes of qualifying experiences, of describing 
aspects of objects and events, tend to be very 
similar, regardless of what language one uses 
or what culture he happens to have grown up 
in. 

The 60 or 70 highest ranked qualifiers are 
shipped back to the field, where their oppo- 
sites are elicited and they are made into bi- 
polar scales—the highest ranked 50 surviving 
this process being kept. Then another group 
of young males rate the original 100 concepts 
against these 50 scales, 200 subjects being di- 
vided into subgroups of 20 judging subsets of 
10 concepts because this task is so time-con- 
suming. Thus we generate a cube of semantic 
data, 100 concepts times 50 scales times 20 sub- 
jects, in each language/culture community. 
These data are returned to Illinois, where stan- 
dard correlation, factor analysis and rotation 
procedures are applied by our computers. Such 
analyses of each cube of data yield a unique 
solution for each language/culture community; 
nevertheless, it is gratifying to be able to re- 
port that for the 12 communities carried 
through this stage so far, evaluation, potency 
and activity are identifiable as the first three 
factors in magnitude—and usually in that 
order. s 

The most crucial test of factorial similarity 
is to put the data from all countries into the 
same mathematical space. Since we have neither 
identical subjects nor identical scales, we must 
do this by assuming that our 100 translation- 
equivalent concepts are identical, and by corre- 
lating across them. To the extent that these 
100 concepts are not identical in meaning 
across our language/culture communities—and 
we can be sure that they are not identical— 
then all this can do is introduce “noise” into 
our system and reduce our correlations, that 
is, work against us. In the first such test that 


112 ATTITUDE MEASUREMENT 


we ran, we selected the three highest-loading 
scales on the first three factors of the unique 
analysis of seven countries (63 scales in all), 
intercorrelated them across 100 common con- 
cepts, and then factored the correlation matrix 
by usual procedures. The results clearly indi- 
cated that evaluation, potency, and activity 
factors are truly pancultural. The three domi- 
nant factors do not represent particular lan- 
guage/culture communities, but rather common 
semantic dimensions. 
What is the purpose of all this busy work 
.in many lands and many tongues? The first, 
purely scientific purpose has been to demon- 
strate that human beings the world over, no 
matter what their language or culture, do share 
a common meaning system, do organize experi- 
ences along similar symbolic dimensions. In 
contradiction to Benjamin Lee Whorf's notion 
of "psycholinguistic relativity," here is at least 
one aspect of language behavior that is uni- 
versal. A second, more practical purpose of this 
research is to develop and apply instruments 
for measuring "subjective culture"—meanings, 
attitudes, values, customs and the like—instru- 
ments that can be shown to be comparable 
across languages and thereby break through the 
language barrier. The demonstration of com- 
mon semantic factors makes it completely 
feasible to construct efficient "semantic differen- 
tials” for measuring the meanings of critical 
concepts cross-culturally, with reasonable con- 
fidence that the yardstick being employed is 
something better than a rubber band. It is now 
possible to ask questions like: "How does the 
meaning of the self vary from culture to cul- 
ture?” "How do attitudes toward leadership 
and authority vary around the world?" "Is a 
common subjective culture developing among 
the world's elites?"—and be reasonably sure 
that the answers are not artifacts of translation. 
Now, given this long introduction, let me 
turn directly to the matter of measuring atti- 
tudes cross-culturally. Despite the plethora of 
definitions of “atitude,” there seems to be gen- 
eral consensus that (a) they are learned and im- 
plicit, (b) that they may be evoked either by 
perceptual signs or linguistic signs (c) that they 
are predispositions to respond evaluatively to 
these signs, and (d) that the evaluative predis- 
position may fall anywhere along a scale from 
“extremely favorable” through “neutral” to 
“extremely unfavorable.” 
In all of the general factor analyses of affec- 
tive meanings we have done to date, the first 


and dominant factor—usually accounting fot 
about twice as much variance as any other— 
has always been clearly identifiable as Evalua- 
tion. Furthermore, being a bipolar factor, 
graded in intensity in both directions from a 
neutral point, it meets the criterion of reflect- 
ing predispositions from “extremely favorable” 
through “neutral” to “extremely unfavorable.” 
It is also clear that, in the general mediation 
theory of meaning of which our semantic dif- 
ferential measuring operations are part, mean- 
ings of concepts are implicit reactions to either 
perceptual or linguistic signs, and they are 
learned. The evaluative factor of semantic 
differentials thus seems to meet all the criteria 
for a measure of attitude, and it is therefore 
tempting to simply define “attitude” toward 
any concept as its projection onto the evalu- 
ative factor in the total meaning space. 

We do have some empirical justification for 
this identification. My colleague during the 
early development of semantic differential tech- 
nique, Dr. Percy Tannenbaum, compared E- 
factor scores for subjects judging the concepts 
“the Negro,” “the church,” and “capital pun- 
ishment” with their scores on special 'Thurstone 
attitude scales devised to tap each of these atti- 
tudes. Reliabilities of the E-factor scores proved 
to be slightly higher, actually, than those of 
the Thurstone scales. More important, the E- 
factor scores for the concepts correlated with 
their corresponding Thurstone scale scores as 
highly as reliabilities could allow. In other 
words, it would appear that whatever the spe- 
cific "Thurstone scales were measuring—and 
they were designed to measure attitude—the 
E factor was measuring equally well. 

But if this is the case, then the E factor of 
the semantic differential has an obvious advan- 
tage: it applies exactly the same set of evalu- 
ative scales to all three concepts, "Negro," 
"church" and "capital punishment," whereas 
in the Thurstone technique unique statements 
must be selected and scaled for each concept 
whose attitude is being assessed. The E factor 
of the semantic differential thus has the advan- 
tage of being potentially general across the 
concepts being studied. If such generality is an 
advantage within a single language/culture 
community it becomes absolutely essential if 
we are to measure attitudes comparably, across 
differences in both languages and cultures. One 
can imagine the difficulties there would be in 
trying to translate and show scale equivalence 
for statements like "I would not admit a Negro 
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to my social club,” or “I go to church every 
Sunday.” 

You will recall that the factors obtained 
from different language/culture communities 
were found to be truly pancultural in the 
direct, mathematical sense that the scales from 
these different language/culture communities 
were found to be highly correlated when used 
in rating the standard 100 concepts. We will 
now take the four scales for each community 
which load highest and most purely on the pan- 
cultural evaluative factor and use their com- 
posite value for each concept, transposed to 
a +3 to —3 scale, as an SD-attitude-score— 
that is, a semantic-differential-determined atti- 
tude score for each concept. The scales used 
in this analysis, both in the native language 
and in approximate English translation, are 
given in Table 1. You will note that although 


they are not—in many cases translation equiva- 
lents, they do maintain a common evaluative 
feeling-tone. 

Before presenting some illustrative results, 
I would like to offer a few words of caution 
about interpretation. First, these data were col- 
lected in the course of the “tool-making” phases 
of our work and hence the concepts were not 
selected for their relevance to attitude measure- 
ment. Second, due to the large number of judg- 
ments involved in the concept-on-scale task, 
subgroups of 20 subjects judged subgroups of 
10 concepts; therefore, the N in each case is 
only 20. In our work with American subjects 
we have found factor-score differences as small 
as half a scale unit to be significant at beyond 
the .05 level for N's of this magnitude, but to 
be on the safe side I would use a full scale unit 
as a significance criterion here. Third, there 


TABLE 1. Scales Contributing to SD-Attitude-Scores for 


Seven Language/Culture Communities 


Native Language English Translation 

American English nice-awful 
sweet-sour 
good-bad 
happy-sad 

Dutch prettig-naar pleasant-unpleasant 
gezellig-ongezellig cozy-cheerless 
mooi-lelijk pretty-ugly 
gelukkig-ongelukkig happy-sad 

Belgian Flemish aangenaam-onaangenaam _agreeable-disagreeable 
plezierig-vervelend pleasant-boring 
gezellig-ongezellig cozy-cheerless 
prachtig-afschuwelijk magnificent-horrible 

French sympathique-antipathique _likeable-repugnant 
rassurant-effrayant calm-frightened 
gai-triste happy-sad 
gentil-méchant nice-awful 

Finnish hauska-ikava nice-awful 
valoisa-synkka light-gloomy 
makea-hapan sweet-sour 
onneton-onnellinen happy-sad 

Japanese pleasant-unpleasant 


gale «bu 


Indian Kannada tee: A 


mua - vr» 


comfortable-uncomfortable 
thankful-troublesome 
happy-sad 


merciful-cruel 


good-bad 
beautiful-ugly 
delicate-rough 
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TABLE 2. SD-Attitude Scores for Some “Attitude Objects" among 


Standard 100 Concepts 


Language/Culture Communities 


Ameri- Finn- Japa- Kan- 
Concepts can Dutch Flemish French ish nese nada 
Work 0.5 0.7 —0.2 0.9 0.3 1.0 1.0 
Wealth 1.3 0.5 1.2 0.9 —0.1 0.9 0.4 
Doctor 1.6 1.0 —0.7 0.8 0.3 1.0 1.0 
Luck 1.6 1.6 1.6 1.5 r2 2.5 1.0 
Peace 2.0 1.4 2.8 2. 1.1 2.3 0.8 
Policeman 0.8 —0.4 —90.8 —0.4 —0.5 0.0 1.0 


appear to be real differences in scale-checking 
style between language/culture groups; in the 
present case, the Flemish-speaking Belgians 
have SD-attitude-scores beyond plus or minus 
1.5 in polarization for 56 of the 100 concepts, 
whereas the Finns and the Kannada-speaking 
Indians have only 6 and 9 concepts respectively 
reaching this degree of polarization. It is also 
true that inconsistencies in attitudinal direction 
among the individual subjects in a group will 
tend to cancel the composite scores toward neu- 
trality, but since we are here interested in cul- 
tural meanings, not individual, this will not 
concern us. Finally, you should keep in mind 
the fact that the subjects are unmarried males 
of junior high-school age. 

With these caveats in mind, let us turn to 
some results. Table 2 reports SD-attitude-scores 
for some concepts among our sample of 100 
that are usually considered objects of attitude. 
Attitude toward "work" is most favorable for 
the Kannada-speaking Indians of Mysore and 
the Japanese, but actually slightly negative for 
the Flemish-speaking Belgians. The concept of 
“wealth” is quite positive in evaluation for 
Americans and Flemings but negative for 
Finns. Flemish and Finnish subjects have neu- 
tral or slightly negative attitudes toward 
“doctor,” in contrast to the very favorable 
American attitude. For some reason—unknown 
to me at least—the Japanese have an extremely 
positive feeling about "luck." "Peace" is fa- 
vored highly by everyone, and about equally 
if we take into account the differences in aver- 
age polarization; but only for Indians and 
Americans is “policeman” a positive concept. 

Table $ presents sets of concepts that are 
universally favorable, neutral, or unfavorable— 
insofar as our small sample of seven language/ 
culture communities can be considered an ade- 


quate sample of the universe! "Girl" "love," 
"marriage" and "mother" are highly favorable 
notions, as are “pleasure,” “freedom,” “success,” 
“sympathy,” and “friend”—all of which is not 
too surprising. The list of universally unfavor- 
able concepts reads like a catalogue of human 
misery: the emotional states of "pain," "anger," 
"guilt," and "fear"; the conditions of "danger," 
"punishment," and “crime”; the specific threats 
posed by "thief," “snake,” and "poison"; and 
then, at last, "battle," "defeat" and “death.” 
Yet, among these generally unfavorable no- 
tions, there are some interesting deviations: 
note the rather mild disapproval registered by 
the Indians (even taking into account their 
generally reduced polarization), as. compared 
with the intensely negative attitudes of the 
Japanese; note also the relative lack of concern 
of the French for things like "pain," "danger," 
and "death" as compared with the Flemish- 
speaking Belgians. 

It is interesting, but perhaps not surprising, 
that the evaluatively neutral concepts are all 
natural objects and phenomena, "Stone," 
“root,” "rope," "knot" and “cloud” surely 
are not objects about which people are likely 
to have strong attitudes, nor are the natural 
phenomena of "wind" and “heat” (unless they 
are in excess, in which case they usually have 
special names like "tornado"). But this is not 
necessarily the case. The first set of. concepts 
in Table 4 might have been expected to be 
attitudinally neutral, too, but they are not for 
all people in our sample. The linguistically 
and culturally close Dutch and Flemings have 
a highly favorable attitude toward “Wednes- 
day"; all Indo-European language communities 
(Americans, Dutch, Flemish, and French) place 
a high valuation on "chair," in contrast to the 
non-Indo-European groups (Finnish, Japanese 
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TABLE 3. Concepts Having Favorable, Neutral, and 
Unfavorable SD-Attitude-Scores Among Standard 100 Concepts 


Concepts Language/Culture Communities 

Ameri- Finn- Japa- Kan- 
Favorable can Dutch Flemish French ish nese nada 
Girl 2.1 2.0 2.8 1.9 1.4 1.1 1.1 
Love 2.2 2.1 2.2 2.0 1.8 1.3 1.4 
Marriage 2.8 1.9 2.0 2.0 1.7 1.4 1.1 
Mother 2.1 2.1 2.1 2.3 1.1 1.9 1.7 
Pleasure 1.8 1.9 2.8 2.2 1.3 2.0 1.1 
Friend 1.8 1.4 2.0 1.5 1.1 1.5 1.8 
Freedom 2.3 2.0 2.1 1.9 1.9 2.2 1.6 
Sympathy 1.3 1.3 2.1 21 1.4 0.8 1.0 
Music 1.6 1.7 1.9 1.8 1.2 14 1.9 
Sleep 1.8 L2 1.7 1.2 1.4 1.6 0.4 
Sun 1.6 1.8 14 2.1 1.2 1.7 0.9 
Success 1.5 2.1 2.1 1.7 0.5 2.2 1.0 
Neutral 
Wind 06 —03  —10 0.6 0.1 0.4 0.3 
Heat 0.4 0.5 0.3 0.8 0.0 0.3 0.4 
Stone 0.0 —0.6 0.2 02  —04 04 —08 
Cloud 1. 0.5 01  —05 0.2 0.5 0.7 
Root 0.6 00 —0.1 01 —0.4 0.5 0.9 
Rope 0.2 0.8 05 —01 —0.7 0.6 0.7 
Knot 0.2 0.0 0.0 0.8 0.0 0.0 0.5 
Unfavorable 
Pain ie 8s 10, 12 .—L9 —0.7 
Anger Lu 0504929918. 1.3) —1.5 —13 
Guilt 42) —13 —22. —13  —09  —21 —0.5 
Fear quao heap 9659.9 Ba) 46 15 —0.3 
Danger —16 —l8 —21 —09 —11 —20 —0.8 
Punishment —09 —11 —28 —16 —17 -—11 —1.1 
Crime jo =18  —19 —19, —15  —25 —0.6 
Thief arg 220022) 14 14 .—25 —1.3 
Snake 205-1 Sap —1.6 Rhee 04, ' —2.0 —0.5 
Poison epi et 62.8 0 23.6. 14 —22  —09 
Battle 219) —L4. —17 —17  —18 | —2] —0.5 
Defeat =i o A 14, 18 —0.4 
Death —09 —16 —20 —03 —13 —12 —0.9 
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and Kannada); the Kannada-speaking Indians 
have, for them, an extremely positive feeling 
toward the “moon,” while the Japanese feel 
similarly about "tree." Just why such affective 
investments should exist, I do not know, but I 
Suspect they are reflections of uniquenesses in 
Subjective culture and should be of interest to 
cultural anthropologists. 

Other comparisons in this table—which I 
have titled “Attitudinal Bits to Conjure With" 
—may have similar interest. The second set of 
concepts, all referring to human and kin clas- 


sifications, display close similarities among the 
Indo-European language groups, who, I believe, 
also share the same kinship system. We also 
note that attitudes toward "women" are higher 
than toward “men” everywhere except in Japan 
—however, keep in mind that our subjects 
were boys between 14 and 16 years of age. The 
set of future-oriented concepts, "hope," "fu- 
ture,” and "purpose," display an interesting 
pattern—they are all positively evaluated by 
Americans, French and Japanese, but are essen- 
tially neutral attitudinally for the Dutch, 
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TABLE 4. Attitudinal Bits to Conjure With 


_—__eee 


Language/Culture Community 


Ameri- Finn- Japa- Kan- 
Concept can Dutch Flemish French ish nese nada 
Wednesday 0.6 1.4 1.5 0.6 0.3 08 0.4 
Chair 1.2 1.4 1.6 1.4 0.3 09 08 
Moon 1.2 1.1 1.0 1.1 0.6 13 20 
Tree 0.8 0.9 1.1 14 0.1 20 08 
Husband 1.2 1.7 2.0 1.4 0.2 1.5 1.8 
Father 1.9 1.2 1.4 2.2 1.0 19 14 
Man 11 1.1 1.4 1.8 0.5 13 08 
Woman 1.9 1.9 1.8 24 1.1 Ido. dd 
Mother 2.1 2.1 2.1 2.8 1.1 A9 D 
Girl 2.1 2.0 23 1.9 1.4 1.1 1.1 
Hope 1.9 0.7 0.6 1.8 0.7 L9 0$ 
Future 14 0.7 0.5 1.2 0.3 1.1 0.5 
Purpose 1. 0.6 0.6 1.1 0.3 L0 05 
Hunger —l5 —18 —19 —03 —10 —12 05 
Need 0.5 04  —27 0.0 0.1 09 0.7 
Meat 1.6 0.7 1.5 07  —01 08 0.2 
Bread 1.6 1.0 1.5 14 0.4 14 0.7 
Fruit 1.5 1.4 2.2 1.5 0.7 17 15 
Food 1.8 1.6 1.4 1.5 0.6 L6 09 
Laughter 1.9 1.2 1.9 1.6 1.1 12;. 02 


Flemings, Finns, and Indians. Does this neces- 
sarily imply that Americans, French and Japa- 
nese are more hopeful about the future and 
more purposefully striving toward it? 

The next set of concepts suggests a para- 
doxical kind of denial mechanism: The Indians 
in Mysore rate "hunger" and "need" relatively 
positively, and yet (with the exception of 
"fruit") have relatively neutral attitudes toward 
the food concepts; on the other hand, the well- 
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B. Multidimensional Measurement Techniques 


14. Psychological Scaling without a Unit of Measurement 


CLYDE H. COOMBS 


I. INTRODUCTION 


The concept of measurement has generally 
meant the assignment of numbers to objects 
with the condition that these numbers obey 
the rules of arithmetic (1). This concept of 
measurement requires a ratio scale—one with 
a non-arbitrary origin of zero and a constant 
unit of measurement (3). The scales which are 
most widely made use of in psychology are 
regarded as interval scales in that the origin 
is recognized to be arbitrary and the unit of 
measurement is assumed to be constant. But 
this type of scale should be used only if it can 
be experimentally demonstrated by manipula- 
tion of the objects that the numbers assigned 
to the objects obey the laws of addition, The 
unit of measurement in psychology, however, 
is obtained by a combination of definitions and 
assumptions, which, if regarded as a first ap- 
proximation and associated with a statistical 
theory of error, serves many practical purposes. 
But because we may sometimes question the 


* Excerpted from Psychological Review, 1950, 57, 
145-158, with permission of the author and the 
American Psychological Association. This paper is 
a condensation of some of the ideas contained in 
a chapter of a general theory of psychological 
scaling developed in 1948-1949 under the auspices 
of the Rand Corporation and while in residence 
in the Department and the Laboratory of Social 
Relations, Harvard University. Development of the 
theory before and after the sojourn at Harvard 
was made possible by the support of the Bureau 
of Psychological Services, Institute for Human 
Adjustment, Horace H. Rackham School of Grad- 
uate Studies, University of Michigan. 


meaning of the definitions and the validity of 
the assumptions which lead to a unit of meas- 
urement, it is our intent in this paper to de- 
velop a new type of scale not involving a unit 
of measurement. This type of scale is an addi- 
tion to the types set up by S. S. Stevens (3). 
Stevens recognized ratio, interval, ordinal, and 
nominal scales. The type which we shall de- 
velop falls logically between an interval scale 
and an ordinal scale. We shall make no assump- 
tion of equality of intervals, or any other as- 
sumption which leads to a unit of measurement. 
We shall find, however, that on the basis of 
tolerable assumptions and with appropriate 
technique we are able to order the magnitude 
of the intervals between objects. We have called 
such a scale an "ordered metric" We shall 
develop the concepts first in an abstract manner 
with a hypothetical experiment and then illus- 
trate the ideas with an actual experiment. 
Under the limitations of a single paper we shall 
not present the psychological theory underly- 
ing some of the concepts and we shall place 
certain very limiting conditions on our hypo- 
thetical data in order to simplify the presen- 
tation. 


Il. THE PROBLEM 


When we set up an attitude scale by any of 
a variety of methods, for example the method 
of paired comparisons and the law of compara- 
tive judgment, we order statements of opinion 
on the attitude continuum and assign a number 
to each statement. We recognize in this in- 
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stance that the origin for the numbers is arbi- 
trary. We then follow one of several possible 
procedures (determining which statements an 
individual will indorse, for example) to locate 
the positions of individuals on this same con- 
tinuum. Because both individuals and stimuli 
have positions on this continuum we shall call 
it a joint distribution, joint continuum, or J 
scale. In general, with a psychological con- 
tinuum, we might expect that for one indi- 
vidual the statements of opinion, or stimuli, 
have different scale positions than for another 
individual. Thurstone (4) has provided the con- 
cept of stimulus dispersion to describe this 
variability of the scale positions on a psy- 
chological continuum. We have recently (2) 
discussed an equivalent concept for the vari- 
ability of scale positions which an individual 
may assume in responding to a group of 
stimuli. These two concepts have been basic 
to the development of a general theory of scal- 
ing to which this paper is an introduction. 

For didactic purposes we shall achieve brev- 
ity and simplicity for the presentation of the 
basic ideas underlying an ordered metric scale 
if we impose certain extreme limiting condi- 
tions on the variability of the positions of 
stimuli and individuals on the continuum. 
These conditions are that the dispersions of 
both stimuli and individuals be zero. In other 
words these conditions are that each stimulus 
has one and only one scale position for all indi- 
viduals and that each individual has one and 
only one scale position for all stimuli. For pur- 
poses of future generalization we shall classify 
these conditions as Class 1 conditions. Stimuli 
will be designated by the subscript j and the 
position of a stimulus will be designated the 
Q; value of the stimulus on the continuum. 
The position of an individual on the con- 
tinuum will be designated the C, value of an 
individual i. 

If we conceive of the attribute as being an 
attitude continuum, the C, value of an indi- 
vidual is the Q, value of that statement of 
opinion which perfectly represents the attitude 
of that individual. In this case the C, value of 
an individual is his ideal or norm. We shall 
assume that the degree to which a stimulus 
represents an individual's ideal value is depen- 
dent upon the nearness of the Q; value of the 
stimulus to the C, value of the individual. 

We shall then make the further assumption 
that if we ask an individual which of two 
statements he prefers to indorse he will indorse 


that statement the position of which is nearer 
to his own position on the continuum. 

Thus if asked to choose between two stimuli 
j and k, the individual will make the response 


j2k (1) 


|Q; — 6| < |Q — G| 


where j > k signifies the judgment “stimulus 
j preferred to stimulus k.” 

Under the extreme limiting conditions we 
have imposed on the C; and Q, values the 
method of paired comparisons would yield an 
internally consistent (transitive) set of judg- 
ments from each individual, though not neces- 
sarily the same for each, and each different set 
of such judgments could be represented by a 
unique rank order for the stimuli for that indi- 
vidual. We shall call the rank order of the 
stimuli for a particular individual a qualitative 
I scale, or, in general, an I scale, 

Thus if an individual placed four stimuli in 
the rank order A B C D as representing the 
descending order in which he would indorse 
them, then, this would be equivalent to the 
consistent set of judgments A > B, A > C, 
A > D, B > C, B > D, C > D; where the 
symbol “>” signifies “prefer to indorse,” as 
before. The order, A B C D, is the qualitative 
I scale of this individual. Hence for Class 1 
conditions it is sufficient to collect the data by 
the method of rank order; the greater power 
of the method of paired comparisons would be 
unnecessary and wasted. 

Let us assume now that we have asked each 
of a group of individuals to place a set of 
stimuli in rank order with respect to the rela- 
tive degree to which he would prefer to indorse 
them. Our understanding of the results that 
would follow will be clearer if we build a 
mechanical model which has the appropriate 
properties. This is very simply done by imagin- 
ing a hinge located on the J scale at the C; 
value of the individual and folding the left 
side of the J scale over and merging it with the 
right side. The stimuli on the two sides of the 
individual will mesh in such a way that the 
quantity | C, — Q,| will be in progressively 
ascending magnitude from left to right. The 
order of the stimuli on the folded J scale is 
the I scale for the individual whose C, value 
coincides with the hinge. 

It is immediately apparent that there will 
be classes of individuals whose I scales will be 
qualitatively identical as to order of the stimuli 


if 
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and that these classes will be bounded by the 
midpoints between pairs of stimuli on the J 
scale. For example, suppose that there are four 
stimuli, A B C D, whose Q; values or positions 
on a joint continuum are as shown in Fig. 1 
and that there is a distribution function of 
the positions of individuals on this same con- 
tinuum as indicated. 

If we take the individual whose position is 
at X in Figure 1, the I scale for that individual 
is obtained by folding the J scale at that point 
and we have the scale shown in Figure 2. 

The qualitative I scale for the individual at 
XisABC D. 

If we take the individual in position Y as 
shown in Figure 1 and construct his I scale, 
we have the scale shown in Figure 3. 

'The qualitative I scale for the individual at 
Y is C D B A. 

Consider all individuals to the left of posi- 
tion X on the J scale in Figure 1. The I scales 
of all such individuals will be quantitatively 
different for different positions to the left of X. 


A B 


For every one of them, however, the order of 
the stimuli on the I scale will be the same, 
A B C D. We shall regard these I scales as 
being qualitatively the same. As a matter of 
fact, the I scales of individuals to the right of 
X continue to be qualitatively the same until 
we reach the midpoint between stimuli A and 
B. For an individual immediately to the right 
of this midpoint the qualitative I scale will 
be B A C D. I scales immediately to the right 
of the midpoint AB will continue to be quali- 
tatively the same, B A C D, until we reach the 
midpoint between stimuli A and C. Immedi- 
ately past this midpoint the qualitative I scale 
is B C A D. Continuing beyond this point a 
complicating factor enters in which we shall 
discuss in a later section under metric effects. 

The distinction which has been made here 
between quantitative and qualitative I scales 
is of fundamental importance to the theory of 
psychological scaling. In almost all existing 
experimental methods in psychological scaling 
we do not measure the magnitudes | C, — Q; |. 


[n D 


Figure 1. A joint distribution of stimuli and individuals. 
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Figure 3. The I scale of an individual located at Y in the joint distribution. 


120 ATTITUDE MEASUREMENT 


but only observe their ordinal relations for a 
fixed C;.1 The kind of information which is 
obtained by the experimenter is essentially 
qualitative in nature. 

As we shall see, data in the form of I scales 
may tell us certain things: 


(1) whether there is a latent attribute under- 
lying the preferences or judgments; 

(2) the order of the stimuli on the joint 
continuum; 

(3) something about the relative magnitudes 
of the distances between stimuli; 

(4) the intervals which individuals are placed 
in and the order of the intervals on the 
continuum; and 

(5) something about the relative. magnitudes 
of these intervals. 


HI. A HYPOTHETICAL PROBLEM 


Let us now conduct a hypothetical experi- 
ment designed merely to illustrate the tech- 
nique. Of course this experiment, if actually 
conducted, would not turn out as we shall con- 
struct it, because we shall assume the extreme 
limiting conditions on Q and C values that 
were previously imposed. 

Let us imagine that we have a number of 
members of a political party and that we have 
four individuals who are potential presidential 
candidates. Let us ask each member of the party 
to place the four candidates, designated A, B, 
C, and D, in the rank order in which he would 
prefer them as President. With four stimuli the 
potential number of qualitatively different rank 
orders is 24—the number of permutations of 
four things taken four at a time. If there were 
no systematic forces at work among the party 
members we would get a distribution of occur- 
rences of these 24 I scales which could be fitted 
by a Poisson or Binomial distribution, every- 
thing could be attributed to chance, and the 
experiment would stop there. Instead, let us 
imagine for illustrative purposes a different 
result equally extreme in the opposite direc- 
tion. Let us imagine that from the N individ- 
uals doing the judging only seven qualitatively 


1 The ordinal relations of [C,— C,| may also be 
obtained experimentally for a fixed Q, over a set 
of C, We call such scales S scales by analogy with 
I scales. For sake of simplicity they are not treated 
here but actually the treatment for I scales and 
S scales is identical if the roles of stimuli and 
individuals are merely interchanged. 


different I scales were obtained and these were 
the following: 


L ABCD 
Lc BACD 
Tern AD 
LL SGBAD 
I; CBDA 
I “GD BA 
LU HORA 


The significance of the deviation of such 
results as these from pure chance would be 
self-evident. Consequently we would look at 
these seven I scales to see if there was some 
systematic latent attribute represented by a 
joint continuum such that individual differ- 
ences and stimulus differences on such a con- 
tinuum could account for these manifest data. 
Studying the set of seven I scales we observe 
that two of them are identical except in re- 
verse order, A B C D and D C B A. Further- 
more we see that in going from one to the next, 
two adjacent stimuli in the one have changed 
positions in the next. These are the charac- 
teristics of a set of I scales which have been 
generated from a single J scale. Seven I scales 
is the maximum number that one can obtain 
from a single quantitative J scale of four stim- 
uli under the conditions of Class 1. The sys- 
tematic latent attribute underlying this set of 
I scales is represented by the J scale which 
generates them. Our objective, then, is to re- 
cover this J scale and discover its properties 
or characteristics. 

To recover the J scale we proceed as follows. 
Every complete set of I scales has two and only 
two scales which are identical except in re- 
verse order. These are the I scales which arise 
from the first and last intervals of the J scale. 
Consequently these two I scales immediately 
define the ordinal relations of the stimuli on 
the J scale, in this case A B C D (the reverse 
order, of course, is equally acceptable). From 
the seven I scales we can order on the J scale 
the six midpoints between all possible pairs of 
stimuli. In going down the ordered list of I 
scales as previously determined, the pair of 
adjacent stimuli in one I scale which have 
changed places in the next I scale specify the 
midpoint on the J scale which has been passed. 

Thus in the first interval (Figure 4), we have 
all the individuals to the left of the midpoint 
between stimuli A and B. The second I scale 
is B A C D, and as stimuli A and B have 
changed places in going from the I, scale to 


PSYCHOLOGICAL SCALING WITHOUT UNIT OF MEASUREMENT 121 


A B 


AB AC BC AD BD CD 
l 1 


Figure 4. An example of how the midpoints of four stimuli may section the joint 
distribution into seven intervals, each characterized by an I scale. 


the I, scale we have passed the midpoint AB. 
In going from I, to Ig stimuli A and C ex- 
change orders on the two I scales and hence 
the midpoint between A and C is the boundary 
between I, and I. If we continue this process 
we see that the order of the six midpoints is 
as follows: AB, AC, BC, AD, BD, CD. These 
six boundaries section the joint distribution 
into seven intervals which are ordered as also 
are the stimuli. From the order of the six mid- 
points in the case of four stimuli we have one 
and only one piece of information about metric 
relations on the joint continuum. Because mid- 
point BC precedes AD we know that the dis- 
tance between stimuli C and D is greater than 
the distance between stimuli A and B. We 
shall discuss these points and characteristics in 
more detail in the section on metric effects. 
There are then an infinite variety of quanti- 
tatively different J scales which would yield 
this same set of seven I scales—but there is 
only one qualitative J scale. The J scale in 
Figure 4 meets the conditions necessary to yield 
the manifest data. 

It must be emphasized that all metric mag- 
nitudes in Figure 4 are arbitrary except that 
the distance from stimulus A to B is less than 
the distance from stimulus C to D. 

With the qualitative information obtained 
in this experiment about the latent attribute 
underlying the preferences for presidential 
candidates the next task is the identification of 
this attribute. Here all the experimenter can 
do is to ask himself what it is that these stim- 
uli have and the individuals have to these dif- 
ferent degrees as indicated by their ordinal 
and metric relations. One might find in this 
hypothetical case that it appears to be a con- 
tinuum of liberalism, for example, or of isola- 
tionism. One would then have to conduct an 
independent experiment with other criteria to 
validate one's interpretation. 


IV. METRIC EFFECTS 


While the data with which we deal in the 
vast majority of scaling experiments are quali- 


tative and non-numerical there are certain rela- 
tions between the manifest data and the metric 
relations of the continuum. These relations 
have not all been worked out and general ex- 
pressions have yet to be developed. The com- 
plexity of the relations very rapidly increases 
with the number of stimuli; therefore to illus- 
trate the effect of metric we shall take the 
simplest case in which its effect is made ap- 
parent, the case of four stimuli. 

With four stimuli, A, B, C, and D, there are 
24 permutations possible. Thus it is possible 
to find 24 qualitatively different I scales. Also, 
obviously, each of these 24 orders could occur 
as a J scale which could give rise to a set of 
I scales. Half of the J scales may be regarded 
as merely mirror images of the other half. ‘Thus 
if we have a J scale with the stimuli ordered 
B A D C and identify the continuum as liber- 
alism-conservativism, then, in principle, we also 
have the J scale with the stimuli ordered 
C D A B and would identify it as conservativism- 
liberalism. Hence there are only twelve J scales 
which may be regarded as qualitatively distin- 
guishable on the basis of the order of the four 
stimuli. I scales which are mirror images of 
each other, though, are definitely not to be 
confused. They may well represent entirely 
different psychological meanings. The direction 
of an I scale is defined experimentally—the 
direction of a J scale is a matter of choice. 

Each J scale of four stimuli gives rise to a 
set of seven qualitative I scales. We are inter- 
ested in knowing, of course, whether the J 
scale deduced from a set of I scales obtained 
in an experiment is qualitatively unique. The 
answer to this appears to be yes and immedi- 
ately obvious when it is recognized that a set 
of I scales generated from a J scale has two 
and only two I scales which are mirror images 
of each other; that these two I scales must 
have been generated from the intervals on the 
opposite ends of the J scale, and that the order 
of the I scales within a set is unique. These 
statements are still to be developed as formal 
mathematical proofs and hence must be re- 
garded as tentative conclusions. 
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However, a given qualitative J scale does not 
give a unique set of I scales, For example, 
with four stimuli, we may have the qualitative 
J scale A B C D. This order of four stimuli 
on a J scale can yield two different sets of I 
scales as follows: 


Set 1 Set 2 
ABCD ABCD 
BACD BACD 
BCAD BCAD 
BCDA ———CBAD 
CBDA CBDA 
CDBA CDBA 
DCBA DCBA 


It will be noticed that these two sets of seven 
I scales from the same qualitative J scale are 
identical except for the I scale from the middle 
interval. This arises from the following fact. 
There are six midpoints for the four stimuli 
on the J scale. These are as follows: AB, AC, 
AD, BC, BD, CD. The order and identity of 
the first two and the last two are immutable; 
they must be, in order: AB, AC, ii. ponn 
BD, CD. But the order of the remaining two 
midpoints is not defined by the qualitative 
J scale but by its quantitative characteristics. 
If the interval between stimuli A and B is 
Breater than the interval between C and D, 
then the midpoint AD comes before the mid- 
point BC and the set of seven I scales will be 
set 1 listed above. If the quantitative relations 
on the J scale are the reverse and the midpoint 
BC comes before AD, then the set of seven 
I scales which will result are those listed in 
set 2 above. 

Thus we see that in the case of four stimuli, 
à set of I scales will uniquely determine a quali- 
tative J scale and will provide one piece of 
information about the metric relations. For 
five or more stimuli the number of pieces of 
information about metric relations exceeds the 
minimum number that are needed for ordering 
the successive intervals. However, the particu- 
lar pieces of information that are obtained 
might not be the appropriate ones for doing 
this. It is interesting to note here that this is 
à new type of scale not discussed by Stevens. 
This is a type of scale that falls between what 
he calls ordinal scales and interval scales. In 
ordinal scales nothing is known about the in- 
tervals. In interval scales the intervals are 
equal. In this scale, which we call an ordered 
metric, the intervals are not equal but they 
may be ordered in magnitude. 


As the number of stimuli increases, the vari- 
ety of different sets of I scales from a single 
qualitative J scale increases rapidly. This means 
that a great deal of information is being given 
about metric relations. For example a J scale 
of five stimuli yields a set of eleven I scales 


n 
( in general n stimuli will provide ( ) +1 
2 


different I scales from one J scale ). Depending 


on the relative magnitudes of the four inter- 
vals between the five stimuli on the J scale, 
the same qualitative J scale may yield twelve 
different sets of I scales. This means that for 
a given order of five stimuli on a J scale 
there are twelve experimentally differen- 
tiable quantitative J scales. Previously, in the 
case of four stimuli, we found only two differ- 
entiable quantitative J scales for a given quali- 
tative J scale. 

The particular set of I scales obtained from 
five stimuli may provide up to five of the inde- 
pendent relations between pairs or intervals. 
For example, suppose we have the qualitative 
J scale A B C D E. Among the twelve possible 
sets of I scales which could arise are the fol- 
lowing two, chosen at random: 


Set 1 Set 2 
ABCDE ABCDE 
BACDE BACDE 
BCADE BCADE 
BCDAE BCDAE 
CBDAE———BCDEA 
CDBAE———CBDEA 
DCBAE———CDBEA 
DCBEA DCBEA 
DCEBA DCEBA 
DECBA DECBA 
EDCBA EDCBA 


Let us see what information is given by each 
of these sets about the relative magnitudes of 
the intervals between the stimuli on the J 
scales. Consider set 1 first. The order of the 
ten midpoints of the five stimuli according to 
set 1 is as follows: AB, AC, AD, BC, BD, CD, 
AE, BE, CE, DE. We know immediately, from 
the fact that the midpoint BC comes after AD, 
that the interval between stimuli A and B 
(AB) is greater than the interval between the 
stimuli C and D (CD). We have summarized 
this in the first row of the table below. The 
other rows contain the other metric relations 
which can be deduced from set 1. 
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SET 1 
Order of Relative Magnitude of 
Midpoints Intervals on J Scale 
AD, BC CD < AB 
CD, BE BC <DE 
BD, AE AB < DE 
CD, AE AG < DE 


Or, in brief form, the I scales contained in 
set 1 indicate that the following relations must 
hold between stimuli on the J scale. 


CD < AB < DE 

AB + BC < DE 
In the same manner we may study the impli- 
cations of set 2 for the metric relations be- 
tween stimuli on the J scale. The midpoints 


for this set are in the following order: AB AC 
AD, AE, BC, BD CD, BE, CE, DE. 


SET 2 


Order of 
Midpoints 


Relative Magnitude of 
Intervals on J Scale 


AD, BC CD < AB 
CD, BE BC <DE 
AE, BD DE < AB 
AE, BC CE < AB 


Or, in brief, the relative magnitudes of the 
intervals between stimuli on the J scale are 
known to the following extent. 


BC < DE < AB 

DE + CD < AB 
The different implications of these two sets 
of I scales for the metric relations on the 
J scale may be illustrated by sketching two 


quantitative J scales which have the appro- 
priate metric relations (Figure 5). 


The two sets of I scales which are illustrated 
here were only two of twelve possible different 
sets which could be generated from a single 
qualitative J scale of five stimuli. Each of the 
twelve sets of I scales would imply a different 
set of quantitative relations among the distances 
between stimuli on the J scale. The two sets of 
I scales used here happened to differ from each 
other in three of their particular members. If 
we take the twelve potential sets and make fre- 
quency distribution of the number of pairs of 
sets which have 1, 2, 3, 4, or 5 particular I 
scales different or 10, 9, 8, 7, or 6 I scales in 
common, we get the distribution shown in the 
table. 


Number of Identical 
Ordinal Positions in 
a Pair of Sets with 


the Same Qualita- Number of Such 


tive I Scale Pairs of Sets 
10 18 
9 24 
8 17 
7 6 
6 1 


(:) 
66 = 
2 
——— 
The surface has not even been scratched on 
the generalizations which can be developed. 
Enough has been presented here to provide a 
general idea of the type of information which 
can be derived. 


V. SUMMARY 


We have presented a new type of scale called 
an ordered metric and have presented the ex- 
perimental procedures required under certain 
limiting conditions to secure such a scale. 

We have pointed out that the information 
which could be obtained under these conditions 
is as follows: 


c D E 


A 


Figure 5. An example of two joint distributions with the same order of stimuli but 
different metric relations obtained from different sets of I scales. 
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1,-;The discovery of a latent attribute under- 
lying preferences. 

2. The order of the stimuli on the attribute 
continuum. 


3. Something about the relative magnitudes 
of the distances between pairs of stimuli. 


4. The sectioning of the continuum into in- 
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15. Latent Structure Analysis 


PAUL F. LAZARSFELD 


The present paper will analyze one special pro- 
cedure by which it is possible to make what one 
might call inferential classifications. Any num- 
ber of well-known topics are covered by this 
provisional name tag: a person’s attitude as in- 
ferred from his behavior, the intention of a 
document as inferred from certain linguistic 
characteristics, the morale of a group as inferred 
from its various performances, and many others. 

The matter can be reformulated in the fol- 
lowing way. Empirical observations locate our 
objects in a manifest property space. But this 
is not what we are really interested in. We want 
to know their location in a latent property 
space. Our problem is to infer this latent space 
from the manifest data. This reformulation of 
the relation between concept formation and 
classification by indicators has a number of ad- 
vantages. One of them deserves special atten- 
tion, 

In any empirical classification guided by con- 
ceptual considerations we try to overcome the 
accidental elements inherent in the use of indi- 
cators. Suppose we want to order people accord- 
ing to how they feel about the role of govern- 
ment in economic affairs. We might ask them a 
series of questions as to public ownership of 
railroads, mines, banks, etc. It is reasonable to 
assume that the more someone favors laissez 
faire the fewer of these items he will answer 
bro public ownership. Still we know that many 
individual idiosyncrasies will creep into the an- 
swers. A strong laissez faire person has just read 
about a mine accident and under this impact 
he gives a pro public ownership response to the 
mine item; a strong interventionist happens to 
know a very fine bank president and therefore 
excludes the bank item from his list of pro re- 


* Excerpted from S. Koch (Ed) Psychology: A 
Study of a Science, Vol. $, McGraw-Hill, New 
York, 1959, with permission of the author and the 
publisher, 


sponses. In the manifest property space we are 
at the mercy of these vagaries. But in the latent 
space, as we shall see, we can take them into 
account and thus achieve a more "purified" 
classification. 

We are now ready to turn to the one ques- 
tion which has still been left unanswered: how 
is the probability relation between the observed 
indicators and the intended classification estab- 
lished? How do we move from the manifest to 
the latent property space? 


THE LOGICAL FOUNDATION OF LATENT 
STRUCTURE ANALYSIS: A SYNOPSIS OF THE 
MAIN ISSUES 


Inferential classifications with the help of a 
set of indicators are nothing new in the world 
of science. A doctor who uses a series of tests 
to see whether a patient has tuberculosis, a psy- 
choanalyst who uses free associations to retrace 
a childhood experience, a chemist who observes 
various reactions to identify the nature of some 
substance—all use what might be called diag- 
nostic procedures. They know, or believe they 
know, laws and regularities which link their 
manifest indicators with their latent space. 
Their diagnosis applies previous knowledge to 
a specific new case. 


Some Initial Clarifications 


But there exists a second type of procedure 
where, so to say, the acquisition of general 
knowledge and its application to a specific case 
are performed simultaneously. This happens if 
the starting point of an investigation is a statis- 
tical one and if our attention is mainly focused 
on the covariation of indicators in a large num- 
ber of cases. The present section is devoted to 
a darification of this idea. It will help if we 
sketch the course of the following discussion by 
raising a number of questions and offering 
some preliminary answers. 
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1. With what kinds of manifest material shall 
we deal? They are qualitative, but to further 
simplify matters, they will be dichotomies 
through most of this report. Thus our exam- 
ples will be “yes” or “no” answers to an obser- 
vation. Does a man agree or disagree with a 
statement? Is he native or foreign born? Is a 
city above or below the national suicide rate? 

We shall call any piece of such information 
an “item.” In each case we shall have an item 
list in which items are numbered in an arbi- 
trary but fixed way. The number of items in 
this list coincides with the dimensionality of 
the manifest property space. 

One alternative of each dichotomy will arbi- 
trarily be called positive (+), the other negative 
(—). Often a judicious use of these designations 


will help in intuitively grasping the material as . 


a whole, Each object in our study will be char- 
acterized by a response pattern of the following 
kind: 
Response to Item No. 
1 2 3 ats m 
n a + 32: ct 


The term response pattern is taken from ques- 
tionnaire practice but is used here in a meta- 
phorical sense. The items might all be derived 
from observation, e.g., the behavior of a person 
in various situations, and the objects might be 
collectivities and not individuals. 

2. A whole group of “respondents” will be 
characterized by their response frequencies: 
they are the proportion of the group who an- 
swer each item i affirmatively (p,), two items (p,)), 
three items (p;x), etc. A barred index will be 
used to indicate a negative response. Thus p,53 
is the proportion of people who give an affir- 
mative answer to items 1 and 3 and a negative 
answer to item 2. The whole set of these re- 
sponse frequencies is called a "dichotomous 
system" and its nature is very important for a 
more detailed study of latent structure analysis. 
For our present purpose, acquaintance with the 
symbolism will suffice. 

3. How are we to represent a latent space? 
In the traditional way by a system of coordinate 
axes, ie, a so-called cartesian frame of refer- 
ence. In this section we shall restrict ourselves 
to one-dimensional latent spaces to facilitate 
exposition, But assessing the number of dimen- 
sions in the latent space is possible. A one- 
dimensional space is, of course, a straight line. 
We shall often call it a latent continuum. 

4. What is the relation between the mani- 


fest items and the latent continuum? It involves 
probabilities. Thus we shall assume that there 
exists 4 curve—preferably a mathematically 
simple one—which relates to each point of the 
latent continuum a specific probability that a 
given item has a positive response. Suppose that 
our latent continuum deals with socioeconomic 
status, and three of our items are ownership of 
a yacht, presence of running warm water, and 
presence of two living rooms in the house of 
the respondent. Common sense would let us 
suspect that the corresponding probability rela- 
tion will be somewhat like Figure 1. The graph 


1 Running warm water 


Item probability 


0 


Socioeconomic status ——- 


Figure 1. Probability relations between the intended 
classification by socioeconomic status and the ob- 
served frequency of three indicators. 


intimates that very quickly as we go up the so- 
cial scale almost every family will have running 
warm water; only the upper crust is likely to 
own yachts, while the probability of more space 
in addition to bedrooms increases fairly propor- 
tionally with socioeconomic status. The truth of 
these surmises is not relevant here. What mat- 
ters is the way they are expressed through Fig- 
ure 1. Of course we have not yet stated precisely 
what we mean by probability, and we are still 
in the dark as to how we would define and 
know a respondent's socioeconomic status. 
These two points soon will be taken up in con- 
siderable detail. 

The curves of probabilities in Figure 1 we 
shall call trace lines; they trace the probability 
for an item as a “respondent” moves along the 
latent continuum. If the latter is a two-dimen- 
sional one then the probabilities form a trace 
surface. In full generality we shall talk of latent 
traces. Notice that a trace line is defined for 
each item separately. Later a crucial problem 
will be what to think about the probability of 
joint responses to several items occurring simul- 
taneously at each point of the latent space. 

5. How are we to understand the term 
“probability” used all through the preceding 
pages? A traditional example of how to look at 


probability is as follows: we take a sample of 
people aged fifty and find out how many die 
within the next year; we compute the propor- 
tion of people who died between the ages of 
fifty and fifty-one. Then we generalize this ratio 
and say that it is the probability of dying at 
the age of fifty within one year. This operation 
can be refined as far as we want to go. We 
might say, for instance, that the probability of 
business executives’ dying within a year at the 
age of fifty is greater than the corresponding 
probability for office clerks. The class for which 
such probabilities are computed and then gen- 
eralized is usually called the reference class of 
the probability (3). 

This same procedure, however, can be used 
in still another way. Suppose we ask an indi- 
vidual, Mr. Brown, repeatedly whether he is in 
favor of the United Nations; suppose further 
that after each question we "wash his brains" 
and ask him the same question again. Because 
Mr. Brown is not certain as to how he feels 
about the United Nations, he will sometimes 
give a favorable and sometimes an unfavorable 
answer. Having gone through this procedure 
many times, we then compute the proportion 
of times Mr. Brown was in favor of the United 
Nations. This we could also call the probability 
of Mr. Brown's being in favor of the United 
Nations. But now the reference class is not 
many Mr. Browns having been asked this ques- 
tion once, but one Mr. Brown having been 
asked the question many times. 

There is one interesting consequence of this 
version of the probability notion. A specific Mr. 
Brown, for instance, might feel that it is his 
duty to be in favor of the United Nations. 
"Therefore, if he is asked a question when he is 
sober, his probability—or, if you please, his 
propensity—to be in favor of the United Na- 
tions might be rather high. Under the influence 
of alcohol, however, his hostility to the interna- 
tional organization might come out. Therefore, 
his probability under the influence of alcohol 
could be different than his probability if he 
were sober. 

How can we know trace lines? This is, of 
course, the central problem of latent structure 
analysis. The remainder of the present section 
Will provide some preparation, consideration, 
and examples for developing an appropriate 
answer, 

The whole configuration of trace lines for all 
items and the location of each object in the 
latent space is called the latent structure. It is 
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a typical example of what is often called a 
“mathematical model,” a construct which is de- 
rived from actual data together with certain 
general reflections on the purpose these data 
serve. In our case the situation is as follows: 
from our manifest data we actually know the 
frequencies in which the various response pat- 
terns occur in a given population; what we 
want to know are the latent parameters of the 
model, the coefficients which characterize the 
latent traces, and the distribution of the popu- 
lation within the latent space. We therefore 
need equations which link the manifest fre- 
quencies to the latent parameters. From these 
so-called accounting equations we then can 
compute all the elements in the model. The 
name given to these equations is meant to in- 
dicate that with the knowledge of the full latent 
structure, we can account for everything known 
about the manifest data. 

In order to clarify this basic idea, it is best 
to discuss in some detail two empirical opera- 
tions with which most research students are well 
acquainted. By a slight extrapolation they be- 
come basic elements of latent structure analysis. 
The situation is somewhat similar to what was 
just mentioned about the concept of proba- 
bility. Probabilities are formal extrapolations 
from the empirical notion of relative frequen- 
cies. In our case we are referring to item anal- 
ysis which forms the basis of the notion of trace 
lines, and to the “explanation” of statistical 
relations which becomes the basis for the ac- 
counting equations. 

We turn first to item analysis and trace lines. 


Item Analysis and Item Curves 


Every graduate student who takes a course in 
applied psychology knows about item analysis. 
If he wants to develop an attitude or a perform- 
ance test he knows that he should proceed in 
the following way. He is permitted to start with 
many questionnaire items which he hopes will 
be indicative of what in the end he wants to 
“measure.” But then he is supposed to distin- 
guish between good items and bad items. This 
he is taught to do in the following way: he 
forms a "raw" score by adding up for each re- 
spondent the number of items which are an- 
swered in the “correct” way. Then he plots each 
single item against this raw score. (This we shall 
call the item curves.) The items which have a 
high association with the raw score are accept- 
able. The items which have a low association 
are considered inappropriate and should be 
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eliminated. We will now give an example of 
such an item analysis, but we will refine it in 
two ways: 


1. We shall plot two items against the raw 
score. 

2. We shall not only plot each item sepa- 
rately, but we shall investigate how the associa- 
tion between the two items is related to the raw 
score. 


Item Analysis Applied Simultaneously 
to More Than One Item 


Our material comes from a public relations 
study where 560 respondents were asked ques- 
tions regarding their attitudes to the oil indus- 
try. Do oil companies treat their workers fairly; 
do they make too much profit; are they wasteful 
of our natural resources, etc.? To each question 
the respondent could give one of five answers, 
which ranged from firmly favorable to firmly 
unfavorable (from the oil industry's point of 
view). There were ten questions in all and eight 
of them were combined into an arbitrary score 
in the following way: a firmly favorable answer 
was given a weight of 4 and so on down to the 
firmly unfavorable, which got a weight of 0. 
Then all the weights were added so that a re- 
spondent's general attitude score could range 
from 0 to 32. This score was used as the “out- 
side continuum" or base variable. Against it 
the probabilities (proportion) of answers to the 
remaining two questions were plotted. These 
two were: 


Item 1. Do the big oil companies control too 
much of the oil business? 

Item 2. Is the oil industry wasteful of our 
natural resources? 


A positive reply (from the industry point of 
view) was one in which a respondent expressed 
at least some disagreement (score class 3 and 4). 
The joint positive response required such dis- 
agreement with both items. In Table 1 we now 
have the data for the item curves for each item 
alone and for the joint responses. The vertical 
marginals of the five partial fourfold tables in- 
dicate the item curve of item 1. In the lowest 
general attitude group, 17 out of 106, or 16 
per cent, give an affirmative response. In the 
group farthest to the right, 84 out of 103, or 84 
per cent, do so. The horizontal marginals indi- 
cate the item curve for the second item; the 
corresponding figures are 31 per cent and 79 
per cent. Thus item 1 (concern with economic 
control) is more expressive of the "underlying" 
classification than item 2 (concern with natural 
resources). Figure 2 shows the item curves for 
items 1 and 2 and adds a third: the proportion 
of people who give a positive answer to both 
items. These proportions are based on the left 
upper corner figures of each of the five fourfold 
tables in Table 1. Note that the item curve for 
both items is more concave (seen from the top) 
than either of the curves for items 1 and 2 
separately. 

The items in our attitude tests have obviously 
been selected by the investigator as indicators 
of an underlying continuum. The item curves 
are a crude representation of the relation be- 
tween these indicators and the intended classi- 
fication. But what about the relation among 
the indicators? The reader will remember that 
we came to the general expectation that indi- 
cators will be statistically related to each other 
because they have their links with the under- 
lying continuum in common. In terms of prob- 


TABLE 1. The Interrelation of Two Test Items for Five Subclasses of 
Respondents Classified According to a General Attitude Score 


Derived from Eight Other Questions* 


Item 2 Item 2 
+ = + -= 
e [sn] 7g + [15] 27, 
^- 897 — 
Total 33 73 106 47 88 135 
General 
Atti- 
tude 0 to 16 17 to 20 
Score 


Item 2 
+- - + - 
28 [s [25] eig +[ 40] 18] 58 e [e| v | 84 
M Maru 
69 55 124 


21 to 23 


Item 2 Item 2 


+ 


55 89 92 79 24 108 


24 to 26 27 to 32 


* A response favorable to the oil industry is indicated by a + sign. 
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Figure 2. Item curves corresponding to Table 1. 


ability notions we can now put it this way: the 
probability of joint occurrence pj, will not be 
PiPo—the chance result of two independent 
probabilities—but, rather, py, will be greater 
than pp. In the empirical data we shall expect 
a positive association in the fourfold table, 
which cross-tabulates the reply to two items. 
This turns out to be the case in our public re- 
lations example, as can be seen from Table 2. 

Table 2 is obtained by adding the five par- 
tial tables in Table 1. 


TABLE 2. The Interrelation of Items 1 and 2 
of Table 1, for All 560 Respondents 


Item 2 
+ 


+ [opre] 
Item 1 
- [m | m 28 


981 279 560 


Here p,a = 160/560 = .29, whereas p; * po is 
only (.47)(.50) — .24. Still, using rather informal 
language, we can say that the responses to two 
items of a test show positive relations because 
they were chosen as indicators of an underlying 
property. But this argument can be turned 
around. If a class of people are alike in an un- 
derlying property, then the indicators of this 
property should not be statistically related in 
this class. In our example we can submit this 
idea to a crude test. Our general score was sup- 
posed to be a crude measure of the general atti- 
tude of the respondent to the oil industry. By 
dividing the respondents into five classes, as in 
Table 1, we get groups of people who among 
themselves have a rather similar attitude. 

In general our expectation is borne out. We 
now have five partial fourfold tables: the asso- 


ciation is negative in one, practically zero in 
two, and positive in two. The five tables can be 
looked upon as chance variations from an asso- 
ciation which is actually zero. Extrapolating the 
results of Table 1, we can say that if by an 
appropriate score, the underlying property of 
a population is kept constant, then the indica- 
tors of the property are statistically unrelated.! 

The essence of our example is this. On the 
basis of their raw scores, we divided our respon- 
dents into five classes. Within these classes raw 
scores are relatively similar, and this similarity 
is understood to correspond to the similarity of 
individuals within a class in their general atti- 
tude toward the oil industry. Just as individuals 
within a class have a similar attitude, individ- 
uals in different classes have dissimilar attitudes. 
This dissimilarity manifests itself in the differ- 
ences between probabilities of affirmative re- 
sponses in the different classes. Within each 
class the probabilities for all people are the 
same for any one item; of course, different items 
will generally have different probabilities of 
affirmative response within a given class. Con- 
sidering the responses to a single item by indi- 
viduals in one class, we still find a mixture of 
positive and negative responses. After all, the 


1 Whether the association or correlation between 
items 1 and 2 in such subclassifications can be 
considered a result of chance can be tested by x? 
procedures. It should be desirable to obtain more 
such examples, because the present one points to 
an interesting possibility. The five associations go 
uniformly from negative to positive as we move 
from low to high general scores. If this turns out 
to be the case in other tests, we would be con- 
fronted with a result in test psychology which 
deserves further investigation and interpretation. 
For our present purpose this matter is irrelevant, 
because we use our concrete data only to lead up 
to an axiomatic idealization. 
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class does not determine the response; it only 
determines the probability of each response. The 
variability of response is supposed to stem from 
accidental elements. Quite irrespective of their 
attitude toward a specific industry, some people 
happen to be more concerned with the preser- 
vation of natural resources, others are more 
worried about the growth of economic monop- 
oly. Biographical and other reasons might ac- 
count for such a difference; in any case, these 
idiosyncratic elements are assumed to be unre- 
lated to each other. Within a class which is ho- 
mogeneous in regard to its basic attitude, the 
answers to specific items are assumed to be un- 
related. This was not quite the case in our con- 
crete example but was enough so that an 
extrapolation seems indicated. We shall define 
a homogeneous class as one in which this sta- 
tistical independence of indicators prevails. 

This leads us to investigate the characteristics 
of a group of respondents which can be con- 
sidered a mixture of subgroups where, within 
the subgroups, a set of indicators are statisti- 
cally independent of each other. 


The “Mixture” Phenomenon and Its Role in 
the Explanation of Statistical Relations 


We start with a simple case. Suppose that in 
each of three groups two items are statistically 
independent. Table 3 should be looked upon as 
an idealization (and simplification) of Table 1. 
We “mix” these three groups and form a new 
one by adding box by box the corresponding 
numbers on the left side of Table 3. On the 
right side we now find an association between 
the two items, which did not exist in the three 
partial tables to the left. Where does this asso- 


ciation come from? We understand this best if 
we look at the margins of the three left-side 
parts of Table 3. In Class I the probability of 
a positive response is much lower on both attri- 
butes than in Class III. If in the combined pop- 
ulation we select successive respondents, they 
will sometimes come from Class I and some- 
times from Class III. In the former case, they 
will be more likely to give negative responses 
on both attributes; in the latter case, both re- 
sponses are more likely to be positive. The sta- 
tistical association between the two items in 
the total population is thus accounted for by 
the fact that each attribute by itself is positively 
related to the general attitude of the respon- 
dents which distinguishes the three subclasses. 
This was, of course, also the case in our pre- 
vious example of the public relations study of 
the oil industry. Table 1 shows that, for all the 
items, the probability of a positive response in- 
creases with the general attitude score. The 
positive association between the two items in 
Table 2 is therefore accounted for by the fact 
that they are both indicators of an underlying 
attitude or, more precisely, the probability of 
positive responses is positively related to the 
general attitude score. The resulting association 
in Table 3 is noticeable but not very strong. 
Therefore we increase the marginal differences 
between the three homogeneous classes and mix 
again. Now the resulting association is much 
more marked. How is this finding to be ex- 
plained in the light of the previous discussion? 
It will be remembered that the marginals in 
the partial subtables correspond to the “item 
curves” of the two items. In Table 4 they are 
clearly much steeper than in Table 3. This 
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means that now the two items have a much 
stronger relation to the underlying continuum 
than before. As a result the interrelation be- 
tween the two indicators on the right side of 
Table 4 is much stronger than in Table 3.2 

So far our emphasis has been on the mixing 
of homogeneous subgroups and the resulting as- 
sociations between indicators. But Tables 1, 3, 
and 4 can also be read in the opposite direc- 
tion, from right to left. We then start with an 
existing association between indicators; we “un- 
mix” the population under study and end by 
showing the homogeneous subgroups in which 
the associations disappear. Actually, this is al- 
ways done if, in empirical research, a statistical 
finding is to be explained. We want to remind 
the reader of the three major types of such 
explanations. 

Type 1. A good example is available from 
political research, In a presidential election ed- 
ucated people vote more frequently than the 
uneducated. We can classify people into three 
groups, however, according to their interest in 
elections. Then on each interest level we can 
set up a fourfold table between voting and edu- 
cation (graduation from high school forming 
the point where higher education begins). We 
then find that with each increase in interest 
the proportion of voters increases, as well as 
the proportion of people having higher educa- 
tion. Within interest groups, however, there is 
practically no relation between education and 
voting. Interest, therefore, accounts for the orig- 
inal relation in terms of what is usually called 
an “intervening” variable. The whole structure 
can be represented by the following scheme 
where arrows stand for a vague idea of causa- 
tion, 


Education > Interest > Voting 


The original two variables are underscored. 
Their association is interpreted through the 
role of “interest.” The interpretation is tested 
by showing that the original association disap- 
pears within subgroups which are homogeneous 
in regard to interest. 


2 The reader should satisfy himself that many 
other combinations could occur. Suppose, €g., that 
we made Class III in Tables 3 and 4 much larger 
than Class I. Then, in the resulting fourfold table, 
both items and their joint occurrence would show 
higher frequencies. 

? Cf. [I]. There a characteristic counterexample 
is included. Men vote more frequently than women. 
This cannot be accounted for by interest. Even 
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Type 2. The second major type of accounting 
is usually known as the controlling of spurious 
factors. 

Examples are almost proverbial: fires where 
many fire engines come out cause more damage; 
does this mean that fire engines are dangerous? 
Obviously not. Large fires bring out many en- 
gines and cause much damage. The arrow 
scheme corresponding to this case would be as 
follows: 


Amount of 
equipment 


Damage 


"A 


Size of fire 


If the size of the fire is “kept constant" there 
would be no positive relation between equip- 
ment and damage. 

In both types of accounting the statistical 
test is the same, and it is the one which we have 
carefully analyzed above. The ultimate relation 
between two attributes is owing to the fact that 
they are both related to a third property; once 
this property is kept constant the original rela- 
tion disappears. The difference between type 1 
and type 2 lies in the sequence of variables 
involved. In both cases we start with an associ- 
ation between two factors: education and voting 
in the first example, equipment and damage 
in the second. But in type 1 the explanatory 
factor intervenes between the two original vari- 
ables; whereas in type 2 it antecedes the dam- 
age as well as the number of engines the fire 
brings out. 

Type 3. The third type of accounting is 
usually less discussed because its outcome seems 
so obvious from a substantive point of view. 
Still it is the most important one for the pres- 
ent purpose, and all our initial examples be- 
long here. When we deal with indicators of a 
supposed underlying property, there exists no 
necessary time relation between the intended 
classification and its overt manifestations. 
Rather the relation here is one of generality 
and specificity. Still, the test of whether we are 
really dealing with appropriate indicators is the 
same as before; we want to know whether the 
underlying property does account for the inter- 
relation between the manifest indicators. We 
would look for a way to classify people accord- 
ing to the underlying characteristic and assume 
that if this is held constant no further statistical 


within the same interest group men vote more 
than women. 
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relation should exist between the various indi- 
cators. 

The most obvious way to make this test is to 
see whether people who are alike on the ma- 
jority of the indicators show any appreciable 
relation between the remaining ones. This we 
did with our oil study example. Another ap- 
proach would be to use a rating scale. For in- 
stance, people could be asked to rate their 
political interest on a scale from 1 to 10. Then 
they could be divided into fairly homogeneous 
classes according to this self-rating. If we then 
have an itemized interest test, we could raise 
this question: does the self-rating account for 
the interrelation between the test items? The 
answer would be in the positive if, on each 
level of self-rating, the items were not statisti- 
cally associated. The arrow scheme correspond- 
ing to that of the previous example would be: 


Item 1 Item 2 
self-rating 
on interest 


Item 3 <— 


It is hoped that we have given enough exam- 
ples so that the reader has a clear picture of the 
following two related facts: 


1. There exists a uniform operation of ac- 
counting for an empirical relation between two 
properties. It consists of studying this relation 
for subclasses of the original population, these 
subclasses being formed by the introduction of 
additional properties. The substantive nature 
of these “accounting properties” and their rela- 
tion with the original data make for the main 
types of accounting procedures as they occur in 
the practice of research. 

2. These various accounting procedures are 
all in fact inversions of the “mixture phenom- 
enon” described above. In mixtures of homo- 
geneous groups, indicators show statistical 
associations: they are due to the covariation of 
the indicators between these subclasses. In- 
versely, associations between indicators in em- 
pirical populations can be accounted for by 
dividing them into homogeneous subgroups: 
the variables along which this “unmixing” can 
be done “explain” the statistical associations 
originally found. 


We can now relate all these considerations to 
latent structure analysis. 


The Accounting Equations and the 
Principle of Local Independence 


We are prepared to answer the question 
which we first raised at the end of the historical 
survey in the first section. There we came to 
the conclusion that the problem of disposition 
concepts boiled down to the task of relating a 
manifest to a latent property space. 'The mani- 
fest space was given by the observed properties 
of our objects, which for our present purpose, 
we have reduced to dichotomies. We came to 
use the word “item” for an indicator and the 
term "response" for its observed presence or 
absence. A "response pattern" was a point in 
such a dichotomous property space. 

The latent space corresponds to our intended 
classification, which, as we saw, was variously 
called in the literature "an underlying charac- 
teristic,” "a trait,” "a disposition,” etc. It was 
not necessary to assume anything about this 
latent space; the number of its dimensions was 
not specified, nor did they need to be of any 
particular mathematical form, dichotomous, 
continuous, or whatever. But when we came to 
this point of our discussion we left it unde- 
cided how we would achieve this latent classi- 
fication, in view of the fact that we have only 
manifest observations available. What solution 
does latent structure analysis propose for this 
problem? 

It defines the latent space as that classifica- 
tion which accounts for the statistical interrela- 
tions between the manifest observed indicators. 
It is the classification which "unmixes" a given 
population into homogeneous subgroups. The 
many consequences of this definition must now 
be spelled out. 

The latent space is not known in advance 
but is defined by its accounting role. When we 
discussed examples of item curves, we first di- 
vided people by some index (e.g, a rating on 
the number of positive responses) into fairly 
homogeneous classes and then studied empiri- 
cally how the indicators were related in the 
various subclasses; Table 1 was a typical exam- 
ple. But in latent structure analysis we do not 
have an empirically provided general classifica- 
tion against which the occurrence frequencies 
or probabilities of the different items can be 
plotted. The underlying classification is derived 
from the statistical behavior of the indicators 
themselves. The sequence of affirmative re- 
sponse proportions for a given item over all 
homogeneous "latent" classes becomes its item 
curve; it is now called a trace line to stress that 


»o« 


it is not directly given but derived from empir- 
ical data. 


Accounting Equations 


Let us assume that we have ¢ homogeneous 
classes and n items. Let us assume further that 
these homogeneous classes are ordered in some 
way. (In Table 3, for instance c= 8, and the 
three classes are ordered from left to right.) Let 
us now focus on two specific items, say, the two 
items in the scheme just mentioned. The re- 
sponse frequencies of the two items in the com- 
posite population (exemplified by the right side 
of Table 3) can be derived from the following 
equations: 

c 
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This is in algebraic form the box-by-box sum- 
mation we have carried out in all our mixing 
examples. The superscript in p,” shows from 
what class x the “latent probabilities” have been 
taken; v? is the proportion of the whole popu- 
lation in class x. Suppose now the joint fre- 
quencies on the left were given and the task 
consisted in computing the response frequen- 
cies in the homogeneous subclasses. We could 
not solve it because there are fewer equations 
than unknowns. Obviously, however, we could 
add more equations by adding more items. Not 
only would that give us more equations of the 
type just mentioned, but it would also add an 
additional type: we could now set up equations 
for higher-order frequencies, for instance, 


e 


Diss = E: v? p," pa" Ps" 
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In general there always occurs a point where 
we have enough equations to solve the whole 
problem. Equations of this type are called ac- 
Counting equations because they permit us to 
derive parameters of the latent structure from 
the manifest data. These accounting equations 
are, in a way, the mathematical summary of 
everything we have said so far. Let us review, 
therefore, how they are related to the different 
elements of our discussion. 

First, they formalize algebraically the diag- 


LATENT STRUCTURE ANALYSIS 133 


nostic procedure by which we make the infer- 
ence from the manifest data to the latent 
position of a respondent. An indicator or test 
item is introduced because we have a more or 
less vague idea how it is likely to be related to 
what we want to find out about each of our 
respondents. We assume that if we could by 
some manipulation put people into various po- 
sitions of this intended classification, their re- 
sponse probability would vary according to this 
general image. The latent probabilities give 
precise expression to the relation between the 
latent and the manifest space. They are tanta- 
mount to a diagnosis for any manifest response 
pattern which might be empirically observed. 

But, of course, we cannot move respondents 
into various positions; we observe each respon- 
dent only once. Instead, we have a variety of 
respondents whom we assume—in the spirit of 
the whole model—to be actually at different 
places in the latent structure. As a matter of 
fact, we make an even stronger assumption: for 
the purpose of our model, we assume that all 
our respondents are alike but for one fact, that 
they are different in regard to the latent prop- 
erty. This is the second element in the whole 
analysis. 

A third element is the following trend of rea- 
soning. Even if we knew where a respondent 
belongs in the latent space, we would have to 
make him respond repeatedly to each item so 
that we could ascertain empirically his response 
probability. But again we must remember that 
each respondent is observed only once on each 
of the items. This difficulty is surmounted by 
the idea that in an empirical population we 
are most likely to have many people who are at 
the same point in the latent space. Now we 
consider such respondents to be, for our pur- 
pose, identical; therefore, the proportion of af- 
firmative answers in such a homogeneous group 
can be taken to be the same as we would have 
obtained if we had observed one member of 
each of these groups repeatedly. 

Let us get this series of constructions clearly 
in our minds by visualizing the process in re- 
verse order. We could get all our trace lines by 
the following procedure: 


1. We imbue one respondent, by some kind 
of manipulation, with various amounts of the 
latent property. 

2. At each point, we make him respond to 
each item repeatedly with “brainwashing” in- 
serted between any two trials. This would give 
us, at each point of the latent space, the prob- 


134 ATTITUDE MEASUREMENT 


ability of an affirmative response for our "typ- 
ical" subjects. 

3. The totality of these probabilities, at- 
tached to each point of the latent space, would 
be the trace of an item—in the one-dimensional 
case, the trace line. 


Now the steps (1) and (2) are replaced by 
the fact that at each point of the latent space 
we have many respondents; we substitute their 
response frequencies for the probabilities we 
are looking for. But remember that even this 
is a fiction. Although we are convinced that our 
whole populaton can be subclassified into such 
homogeneous groups, when we deal with con- 
crete respondents we do not know at what 
point of the latent space they are. Here we take 
advantage of the accounting equations just ex- 
plained and developed. What we actually know 
are the response frequencies of a mixed popula- 
tion to a number of items in all their combina- 
tions. Therefrom we can compute the response 
probabilities in the postulated homogeneous 
subclasses. As a matter of fact, we learn, as a 
by-product of this computation, what propor- 
tion of our respondents is in each of these 
latent classes. 

But notice that even now there is one topic 
which we have not discussed at all, namely, the 
single respondent. We do not know at which 
point of the latent space he is located. This is 
a matter which we will take up only in the 
next section. What we now know is the latent 
structure, the proportion of people in each class 
and the conditional probabilities of giving an 
affirmative response to each item in these classes. 

This whole web of assumptions and deduc- 
tions can be fruitfully divided into three sec- 
tions. One has to do with rather conventional 
ideas which are accepted wherever probability 
notions are introduced; the idea, for instance, 
that for the purpose of a specific investigation 
different people can be considered as alike, and 
that the proportion having a property is an es- 
timate of the probability that a single one of 
them will exhibit it. Although the logical foun- 
dations of this idea are by no means simple, we 
need not justify them here because of their 
general acceptance in all model building. 

A second group of our ideas has to do with 
the problem of unmixing: deriving the prob- 
abilities in homogeneous subclasses from the 
response frequencies of a mixed population. 
This is straightforward algebra and does not 
require any further logical foundation. Actu- 
ally, it is the most characteristic and novel 


aspect of latent structure analysis. Here the ac- 
counting equations come in. 

Finally, we have the principle of local inde- 
pendence. (The term has been suggested by 
Frederick Mosteller.) It covers the phase of our 
discussion in which an intended classification 
(an underlying, intervening variable) is defined 
as the one which divides a given population 
into homogeneous subgroups. The principle of 
local independence identifies the “measure- 
ment" problem with the mixing phenomenon 
or, rather, its inversion—unmixing. For this, no 
further foundation can be introduced. The 
principle is proposed as a mathematical axiom 
which formalizes the basic assumption of what 
we have called index formation in the social 
sciences. If an investigator chooses a number of 
indicators for the purpose of diagnosis, for the 
purpose of putting people or social objects into 
an intended classification, he does assume— 
knowingly or not—that the statistical relations 
between these indicators are essentially owing 
to the fact that they are all related to the in- 
tended latent property. For a group of people, 
therefore, who are alike in regard to this latent 
property, all the indicators will be statistically 
unrelated. This principle partakes of the com- 
mon characteristics of all axioms which are in- 
troduced into a theory—and no theory exists 
without at least one axiom.* 

The idea of making the principle of local 
independence the nub of index construction, 
even of concept formation in the social sciences, 
is the central logical feature of latent structure 
analysis. Together with conventional probabil- 
ity notions and some newly developed but quite 
orthodox algebra, all procedures and all em- 
pirical findings derive from it. 


THE NINE STEPS OF LATENT STRUCTURE 
ANALYSIS 


A latent structure analysis of necessity in- 
volves a certain sequence of operations which 
can be cast into a schedule of nine steps. 


Summary of the Nine Steps 


First we must think about the form of models 
which might reasonably be appropriate. This 
means that we want to consider systems of mani- 
fest and latent variables such that their inter- 
relations mirror the interrelations between 


4 Frederic Lord in discussing the principle of local 
independence has aptly stated that it is “almost 
indispensable for any theory of measurement.” 


ct. (2). 


indicators in the data and the concept which is 
the real object of concern. Having considered 
these questions in a general way, we must state 
our assumptions in explicit mathematical form 
(step 1). Then we can write the accounting 
equations, which give the relations between 
manifest and latent parameters, in the particu- 
lar form which the chosen model imposes on 
them (step 2). Next we must ask what condi- 
tions or restrictions are put on the interrela- 
tions within the data by the assumptions of the 
model (step 3). These “conditions of reduci- 
bility” are useful in a number of ways: first, 
they are explicit statements of relations which 
must hold among the manifest parameters, so 
that by means of simple operations on the data, 
and without solving the accounting equations, 
we can determine whether the assumed model 
is appropriate for the given data. Second, the 
conditions of reducibility are useful in evaluat- 
ing how closely the data are in accord with the 
requirements of the model. Third, the condi- 
tions of reducibility contribute to an under- 
standing of the model and to the question of 
the solvability of the accounting equations. 

This question of the solvability of the ac- 
counting equations may be asked more specifi- 
cally in the form: given the manifest param- 
eters, are there a sufficient number of condi- 
tions imposed by the model to make it possible 
to identify the latent parameters? (step 4) Hav- 
ing answered this question, we proceed to its 
logical corollary: if the equations are solvable, 
how does one actually solve them? (step 5) 

Up to this point everything is algebra. Now 
the data must be introduced, and we are forced 
to do some arithmetic. A “fitting procedure” 
(step 6) in latent structure analysis is usually a 
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shuttling back and forth between data and la- 
tent parameters—using data of lower order to 
identify certain latent parameters, from these 
computing what the data would have been if 
they had fitted the (partially identified) model 
perfectly, then combining these "fitted. data" 
with higher-order manifest data to compute fur- 
ther latent parameters. A fitting procedure has 
two goals: (a) a set of latent parameters and (b) 
a set of “fitted manifest parameters” which are 
perfectly in agreement with the demands of the 
model and at the same time are as close as pos- 
sible to the actual data. How close a fit was 
achieved requires some evaluation (step 7). Two 
questions must be answered here: Are the dif- 
ferences between the actual and the fitted pa- 
rameters small enough? Do the differences ap- 
pear to be randomly distributed, or do they fall 
into some pattern which suggests that a some- 
what different latent structure model would be 
more appropriate? If this is the case, we must 
again start from scratch, except for what we 
have learned by the experience. But if we are 
satisfied with the fit, there is still some work 
to be done. 

We want to know how the respondents who 
give a particular response pattern are distrib- 
uted over the latent space (step 8). And it is of 
interest to ask about the most likely or the most 
typical location of individuals who gave a par- 
ticular response pattern, and in some way to 
assign a score to each respondent, or to each 
response pattern. Similarly we may ask how 
much each item contributes to the diagnostic 
process, and perhaps we may wish to give each 
item a score indicating its ability to discriminate 
between individuals at different points of the 
latent space (step 9). 
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16. The Perception of Social Attitudes 


SAMUEL J. MESSICK 


The attitudes of others may be perceived as 
organized in different ways by different groups 
of people. Consider an attitude area composed 
of attitude toward war, attitude toward capital 
punishment, and attitude toward the treatment 
of criminals (correction vs. punishment). Cer- 
tain people may perceive others as varying from 
an extreme humanitarian to a correspondingly 
extreme nonhumanitarian viewpoint. In such a 
situation some particular person might be seen 
as being antiwar, anticapital punishment, and 
procriminal correction in the sense that one 
attitude directly implies the other and that no 
alternatives are possible (see Figure 1). Such a 


-W +W 
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Figure 1. Conceptualization of the attitudes toward 
war, toward capital punishment, and toward treat- 
ment of criminals as a unidimensional system. 


person might be called a "humanitarian." In 
the same type of schema, some other person, 
who would be called inhumane, might be seen 
as being prowar, procapital punishment, and 
anticriminal correction. This would be a one- 
dimensional system, since the attitudes are not 
perceived separately but as one composite “‘hu- 
mane-inhumane” attitude. People with such a 


* Reprinted from Journal of Abnormal and Social 
Psychology, 1956, 52, 57-66, with the permission of 
the author and the American Psychological Asso- 
ciation. This study was supported in part by the 
Office of Naval Research contract N6onr-270-20 
with Princeton University and also in part by funds 
from the National Science Foundation. The opin- 
ions and conclusions expressed are, of course, those 
of the author and do not represent the views of 
the Office of Naval Research or of the National 
Science Foundation. The initial portion of this 
study was carried out when the author *was an 
Educational Testing Service Psychometric Fellow 
at Princeton University. 
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point of view could not conceive of a person 
who was both antiwar and procapital punish- 
ment. 

Another group of people may perceive the 
attitudes of others as being structured in a dif- 
ferent way (see Figure 2). In this type of struc 
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Figure 2. Conceptualization of the attitude toward 
capital punishment combined with the attitudes 
toward correction of criminals while the attitude 
toward war is seen as an independent dimension. 


ture capital punishment of extreme criminals is 
equated with punishment of criminals in gen- 
eral as a deterrent to crime, but war is seen as 
something independent. This dimensional con- 
figuration provides a “frame of reference” in 
which Type B finds it possible to conceive of 
people in each of the four quadrants. In this 
perceived structuring, two pacifists could be 
conceived of as having widely different views 
toward capital punishment. Although they are 
both viewed as being against war, one could be 
seen as in favor of capital punishment as à 
protection to society and a deterrent to crime 
and the other as against it because "it is just 
as uncivilized as war." Type B would also feel 
it possible for two people to be violently anti- 
capital punishment and yet have different views 
toward war, because war is considered to be 
essentially independent of attitudes toward 
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criminals. It would be difficult under these cir- 
cumstances, however, for Type B to conceive of 
a person who was in favor of punishing minor 
criminals and at the same time against capital 
punishment for extreme criminals. 

In Type C war and capital punishment are 
equated as two means of taking human life, but 
criminal correction is seen as something inde- 
pendent (Figure 3). Thus, Type C might con- 
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Figure 3. Conceptualization of attitudes toward war 
and capital punishment combined as two means of 
taking human life while the attitude toward correc- 


tion of criminals is seen as independent. 


ceive of two religious conscientious objectors as 
being against both war and capital punishment 
because killing is involved, and yet one may be 
considered procorrection for humanitarian rea- 
sons and the other propunishment because evil- 
doers must pay. 

For Type D these three attitudes are viewed 
as being independent of each other, and it is 
possible for this group to conceive of people in 
each of the octants (Figure 4). Of course, the di- 
mensions in any of the above types may be re- 
lated in various ways—the lines drawn at angles 
instead of perpendicularly. 

It must not be construed from the above dis- 
cussion that a given group of people perceive 
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Figure 4. Conceptualization of the attitudes toward 
war, toward and toward the 
Correction of criminals as three completely inde- 
bendent dimensions. 


the attitudes of others in only one way. The 
above types may be considered to be “points of 
view,” and it is certainly possible for an indi- 
vidual to hold several different points of view 
with respect to the attitudes of others, depend- 
ing upon the characteristics of the “other” peo- 
ple considered. Some individuals may be able 
to conceive of a different attitude structure for 
every group of people specified, because they 
perceive different types of people in different 
ways. However, it is also possible that these in- 
dividuals perceive one particular attitude struc- 
ture more often than another, and that in 
the absence of specific knowledge or a detailed 
characterization of the other person, they may 
ascribe to his attitudes this modal structure, It 
is this modal, or dominant, way in which indi- 
viduals generally structure the attitudes of 
others with which the present study is con- 
cerned. 

In psychophysics multidimensional scaling 
methods (12, 13) have been developed which 
yield dimensional configurations for psycho- 
physical judgments. "These methods may be 
promising for the analysis of attitudes. The 
fundamental concept involved in these tech- 
niques is psychological distance, a construct 
that has been widely used by many psycho- 
logical theorists. Psychological distance in the 
attitude realm can be thought of in terms of 
the degree of agreement or disagreement shown 
between two statements. If a person who 
strongly agrees with one statement would also 
be likely to agree strongly with another, then 
these two statements can be considered to be 
psychologically close together. If a person who 
strongly agrees with one statement would not 
be very likely to agree with the other, then 
those two statements can be considered to be 
psychologically far apart. If the psychological 
distances among attitude statements can be 
analyzed in a Cartesian space, it will be pos- 
sible to obtain a configuration of the way in 
which an individual perceives attitudes as be- 
ing structured in a given domain. 


THE PROBLEM AND AN APPROACH TO ITS 
SOLUTION 


As was seen in the introduction, it is possible 
on a priori grounds to structure the attitudes 
of others in terms of several different, yet rea- 
sonable, dimensional arrangements. The actual 
attitude arrangements perceived by individuals, 
however, has seldom been investigated, mainly 
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because of a lack of appropriate methods. 
Knowledge of the ways in which individuals 
perceive attitudes as being structured in oth- 
ers is important psychologically, since such 
perceptions affect an individual's understand- 
ing of others, his relationships and adjustments 
with others, and his actions toward others. It 
might be possible to obtain some information 
about these perceived attitude structures by 
utilizing multidimensional scaling techniques. 
The purpose of this study, then, is first of 
all to see whether a set of perceived attitude 
relationships can be adequately represented in 
dimensional terms and, secondly, to see if two 
groups, which probably differ with respect to 
these attitudes, perceive them as being struc- 
tured in different ways. The adequacy of the 
dimensional representation can be evaluated 
in terms of the requirements of the multi- 
dimensional psychophysics model (see 7, 13, 
18). This model provides for a Euclidean 
dimensionalization of the data, and if the data 
do not have Euclidean properties, one of the 
requirements of the model will not be met, 
ie. the matrix of distance scalar products (see 
7, 18), which must be positive semi-definite 
for Euclidean distances, will be found to have 
large negative roots. Therefore, if large nega- 
tive roots are found, the suitability of the model 
is in question, and dimensions should not even 
be extracted. Unfortunately, however, statisti- 
cal tests for the size of latent roots are not 
available, so arbitrary acceptance regions must 
be set (7, 18). Under such conditions, it might 
sometimes be possible for a matrix to be dimen- 
sionalized when it is actually non-Euclidean. 
Even in this case, however, there are indica- 
tions of the unsuitability of the model, for a 
forced Euclidean description of non-Euclidean 
distances would lead to an increase in dimen- 
sionality. Thus, whenever the number of di- 
mensions extracted approaches the number of 
stimuli, not only is the fruitfulness of the ap- 
proach in that area in doubt, but the suitabil- 
ity of the model is also in question. 
Multidimensional scaling methods have been 
utilized by Richardson [method of triadic com- 
binations (10)] and Torgerson [method of com- 
plete triads (13)] which could be applied for 
this purpose. These methods are essentially 
extensions of the method of paired compari- 
sons to the multidimensional case. The method 
of complete triads could be applied to attitude 
Statements as it stands, but it has the un- 
desirable property that for m statements 


n(n —1)(n —2)/2 judgments would be re- 
quired from each S. The task becomes prohibi- 
tive with more than 10 statements. It would 
be more reasonable to apply a multidimen- 
sional extension of the method of successive 
intervals (3, 11), which would require only 
n(n—1)/2 judgments The procedure for 
such a method would require $ to arrange 
n(n — 1)/2 pairs of stimuli on a distance con- 
tinuum according to the degree of similarity 
of the members of each pair. This question of 
similarity or relatedness can be approached in 
the attitude realm by asking for a judgment 
of the agreement or disagreement shown be- 
tween two statements. The judgment involved 
can be set up in terms of the attitudes of some 
other person in the following way: S is asked to 
imagine the type of person who would strongly 
agree with Statement A and then to decide 
how this same person would also feel about 
Statement B. In this way, S is asked to esti- 
mate the attitude relationships of others, and 
the multidimensional analysis of these data 
will provide a dimensional configuration of 
these perceived attitude relationships. Minimal 
information about the "other" person is pro- 
vided in this procedure, i.e., he is characterized 
only as being in complete agreement with a 
single statement. Perhaps, in this highly un- 
structured situation, this agreement will act as 
a cue to the complex characterization which 
S most often ascribes to other people in gen- 
eral, and S$ will respond in terms of this modal, 
or dominant, characterization. 

Since there is a persistent correlation be- 
tween a person's own attitudes and his esti- 
mates of the attitudes of others (5, 14, 16), the 
procedure suggested here for investigating the 
structure of perceived attitudes bears certain 
similarities to projective instruments. However, 
such correlations characteristically range from 
4 to .6. Such a relationship seems to indicate 
the possibility of assessing a person's own atti- 
tudes by giving him the seemingly objective 
task of estimating the opinions of others, but 
this is not the intent of the present experiment. 
The suggested procedure as it stands does not 
necessarily yield the structure of S’s own atti- 
tudes or the actual structure of attitudes in 
others, but it does yield information about 
the ways in which people perceive attitudes 
“out there” in others. Two groups were used 
in the present experiment, which were thought 
to be diverse with respect to the attitudes 
involved. Any differences in the attitude con- 
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figurations obtained from these groups might 
possibly be traced not only to differences in 
perception but also to differences in the Ss’ 
actual attitudes which had been projected into 
the situation. Since the primary concern of the 
present study was with the perception of atti- 
tudes, the directions and the task were so for- 
mulated as to minimize the effects of projection. 
The success of this intention and the projective 
aspects of the task as formulated will be con- 
sidered in a later section. 


METHOD 


In the present study a multidimensional 
method of successive intervals (1, 6) was applied 
to a set of attitude statements in an attempt 
to discover the number and the nature of the 
relevant dimensions perceived. The attitude 
Statements were chosen from three of Thur- 
stone's scales—attitude toward war, attitude 
toward capital punishment, and attitude toward 
the treatment of criminals. The procedure was 
applied using two diverse groups in an attempt 
to discover any differences which may exist in 
the structuring of these perceived relationships. 
The groups chosen for this purpose were a theo- 
logical seminary class and an Air Force officer 
candidate class. 


The Stimulus Statements and 
Their Experimental Presentation 


Seven statements were selected from each of 
the three Thurstone scales on attitude toward 
war (9), attitude toward the treatment of crim- 
inals (17), and attitude toward capital pun- 
ishment (8). These statements, which were 
selected in such a way as to be fairly evenly 
distributed over the favorable-unfavorable con- 
tinuum for each of the three attitudes, are 
listed below. 


l. On the whole, wars do the world some 
good. 

2. Peace and war are both essential to 
Progress, 

3. War is sometimes necessary because right 
is more important than peace. 

4. War has some benefits, but it's a big price 
to pay for them. 

5. Although war is terrible, it has some value. 

6. War is hardly necessary in the modern 
world. 

7. There is no conceivable justification for 
war, 


8. All criminals are victims of circumstances 
and deserve to be helped. 

9. One way to deter men from crime is to 
make them suffer. 

10. Correction is more effective than punish- 
ment in preventing crime. 

11. Harsh imprisonment merely embitters a 
criminal. 

12. Criminals must be punished. 

13. Petty offenders can be reformed without 
punishment. 

14. Failure to punish the criminal encour- 
ages crime. 

15. The death penalty should be given for 
major crimes. 

16. Capital punishment is good because it 
deters others from crime. 

17. I think capital punishment is necessary 
but I wish it were not. 

18. Until we find a more civilized way to 
prevent crime we must have capital punish- 
ment. 

19. No thinking individual can believe in 
capital punishment as a method of preventing 
crime. 

20. Criminals are pathological people who 
should be corrected, not executed. 

21. The death of a comrade in prison em- 
bitters all the inmates against the state. 


In this procedure, it is not at all necessary 
to use previously scaled attitude statements. 
Any other set of statements, scaled or other- 
wise, which was thought to be of possible rele- 
vance to the domain in question could have 
been used. The selection of previously used 
items for the present investigation was for the 
purpose of gaining some incidental information 
about the claimed unidimensionality of the 
Thurstone scales. 

The statements were arranged in booklet 
form in order to make the task more amenable 
to group presentation. One statement was 
printed at the top of each page, followed by a 
list of statements at the left. To the right of 
each of these statements appeared a series of 
11 boxes, with the first box labeled "Strongly 
Disagree,” the sixth box labeled “Neutral,” 
and the eleventh box labeled “Strongly Agree.” 
The S was asked to imagine the type of person 
who would strongly agree with the statement 
at the top of the page and then to decide how 
this same person would also feel about each 
of the statements on the left. The $ was then 
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to indicate the extent of this person's agree- 
ment or disagreement with each statement by 
placing a cross in one of the boxes to the 
right. In constructing the booklet the order of 
appearance of the statements on each page and 
the order of the pages in the booklet was deter- 
mined by random number techniques. 


The Judgments Required 

The multidimensional method of successive 
intervals requires that the subject arrange all 
possible pairs [n(n — 1)/2 = 210] of the 21 
statements on a distance continuum according 
to the degree of relatedness or agreement 
shown between the members of each pair. In 
order to obtain judgments about perceived 
attitudes, the question of relatedness was set up 
in terms of the attitudes of some other person 
as described above. In terms of the multidimen- 
sional model this task can be considered as 
asking $ to estimate the distance between State- 
ments A and B, while located at A and looking 
at B. The reverse judgment for each pair of 
statements was also obtained in the present 
experiment, ie. S was also asked to imagine 
the type of person who would strongly agree 
with Statement B and then to decide how this 
same person would also feel about Statement A. 
This task may be considered as judging the dis- 
tance from Statement B to Statement A, while 
located at B and looking at A. Thus, the pro- 
cedure requires two judgments for each of the 
210 pairs of statements, making a total of 420 
judgments required from each subject. 

The two judgments of the same distance 
taken from opposite directions were obtained 
in the present experiment in order to see 
whether the data met one of the basic require- 
ments of the multidimensional scaling model. 
The multidimensional attitude scaling model 
is an interpoint-distance model, ie. attitude 
Statements and people are represented in the 
same space by points. The extent of the indi- 
vidual’s agreement or disagreement with the 
statements in the space is represented by the 
distance between the person and the statement. 
The person agrees with items close to him in 
any direction and disagrees with items far 
away. One property of this model, then, is that 
the probability of an affirmative response to 
any statement decreases as the distance between 
the statement and the person increases, with- 
out regard to direction. Statements selected by 
Thurstone scaling methods such as equal- 
appearing intervals, paired comparisons, or suc- 


cessive intervals usually have this property. 
Such statements are commonly referred to as 
“point” statements, and the multidimensional 
scaling model can be properly applied only to 
such items. The requirement that the proba- 
bility of an affirmative response should vary 
inversely with distance independent of direc 
tion is another way of saying that the distance 
between two points in Euclidean space should 
be the same in both directions. 

In the present experiment, then, an attempt 
was made to see whether judgments of the 
same distance taken from opposite directions 
would come out similar. The two judgments 
elicited for each pair of statements were con- 
sidered as independent estimates, and scale 
values were obtained for both the distances 
AB and BA for each pair. If the multidimen- 
sional scaling requirement that the distance 
between two points be the same in both direc 
tions is met by the data, the differences be- 
tween corresponding AB and BA distances 
should be small. Systematic groupings of large 
differences would probably indicate the type 
of asymmetric relationships that might reason- 
ably be expected to occur with cumulative 
items (4) as opposed to the symmetric relation- 
ships expected from point items. If a few large 
differences occur, then, the statements involved 
in these differences should be deleted from the 
study on the grounds of a failure to meet cer- 
tain requirements of the model. However, great 
care should be exercised in evaluating these 
differences, since a few large discrepancies 
might occur by chance alone. Also, large differ- 
ences would be expected for statement pairs at 
either extreme of the similarity scale, since such 
scale values are based on only a few cases and, 
hence, are very unstable. 


The Subjects 


The two diverse groups selected for this ex- 
periment were a seminary class and an Air 
Force officer candidate class. The seminary 
group consisted of 40 third-year male students 
at the Princeton "Theological Seminary in 
Princeton, New Jersey. A regularly scheduled 
class hour had been made available for the 
purposes of this experiment, and the enlisting 
of Ss was put on a purely voluntary basis. Forty 
volunteers were obtained from the class, and 
the rest of the students were permitted to leave 
the room. 

The Air Force group consisted of 82 male 
officer candidates who had just finished the 
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first half of their O.C.S. training at Lackland 
Air Force Base, San Antonio, Texas. 

No direct measure of the actual attitudes of 
these two groups was obtained in the present 
investigation for several reasons. Instead, atti- 
tude differences were inferred from the widely 
different backgrounds of the Ss. It seemed rea- 
sonable that factors which had led to such 
divergent career choices might also be reflected 
in divergent attitudes, especially in an attitude 
area dealing with war and punishment. How- 
ever, the main reason why these actual attitudes 
were not "measured" in the present study was 
that it would have made it difficult, if not im- 
possible, to obtain the cooperation of suitable 
populations of Ss, but other reasons did exist. 

An investigation of the actual attitudes of 
these groups would have to be set up in multi- 
dimensional terms, not only to be comparable 
to the analysis of perceived attitudes, but also 
because some important differences between 
groups are probably reflected in relationships 
among dimensions rather than in responses to 
single scales. On the surface, then, the proper 
approach would appear to be to obtain re- 
sponses to the three Thurstone scales, intercor- 
relate items, and factor-analyze. However, the 
vector model of factor analysis and the inter- 
point distance model of multidimensional scal- 
ing are not directly comparable (6) so the 
actual and perceived attitude structures could 
not be compared anyway. 


RESULTS! 


Each attitude booklet contained responses 
to 210 AB statement combinations and to 210 
BA combinations. Each response, which was 
signified by a cross in one of a series of 11 
boxes, indicated the judged extent of the imag- 
inary other person's agreement with Statement 
B. This set of responses, 420 per booklet, con- 
Stituted the raw data, which were analyzed 
Separately for the seminary and Air Force 
groups, 


The Seminary Group Data 

The 420 combinations of statements were 
scaled by a graphical successive-intervals tech- 
nique developed by Gertrude W. Diederich 
and described in (6). The 210 scale values for 
the AB statement-pairs represented the 210 


1 The data, along with a complete description of 
analytical procedures, appear in the writer's dis- 
sertation (6). 


possible distances among the 21 statements, 
all distances being estimated in one specified 
direction. The scale values for the 210 BA 
statement pairs represented the same distances, 
estimated in the opposite direction. There was 
adequate agreement between corresponding 
AB and BA values, so the two estimates were 
averaged for each distance. The general multi- 
dimensional scaling procedure described by 
Messick and Abelson (7) was then used to ob- 
tain a multidimensional configuration for these 
interstatement distances. 

The end product of the multidimensional 
scaling procedure is a matrix of projections, 
which is analogous to the factor matrix of 
multiple-factor analysis. The rank of this 
matrix is equal to the dimensionality, r, of the 
Euclidean space defined by the experimentally 
obtained distances, and the elements of the 
matrix represent the projections of the state- 
ments on a set of r orthogonal axes placed at 
the centroid of the statements (13, 18). In the 
present experiment, this matrix of projections 
was of Rank 2, and when the two-dimensional 
configuration was plotted, two distinct streaks 
of points were evident. The structure was trans- 
lated slightly in order to place the origin at 
the intersection of the streaks and then was 
rotated orthogonally until the largest streak of 
points was in a horizontal position. This par- 
ticular orientation has the advantage of pro- 
ducing the maximum number of zero projec- 
tions for this two-dimensional space, thus 
permitting a simple psychological interpreta- 
tion. The rotated, translated configuration 
obtained from the seminary group appears in 
Figure 5. 

The horizontal dimension in this figure con- 
tains all 14 statements from the scales of atti- 
tude toward the treatment of criminals and 
attitude toward capital punishment, and thus 
it represents a “propunishment vs. antipunish- 
ment” dimension. Statements 9, 12, 14, 15, 16, 
and to a slightly lesser extent 17 and 18 define 
the propunishment end of the continuum, and 
Statements 8, 10, 11, 13, 19, 20, and 21 define 
the antipunishment end. A “war” dimension 
is located obliquely to the “punishment” dimen- 
sion at an angle of approximately 60°, with 
Statements 1, 2, 3, 4, and 5 lying in the prowar 
direction and Statements 6 and 7 in the antiwar 
direction. The 60° angle between the dimen- 
sions indicates a positive correlation between 
the prowar and propunishment attitudes per- 
ceived by the seminary group. 
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Angle = 60° 30° 


Figure 5. Dimensional configuration of 21 attitude statements for the theological 


seminary group. 


The Air Force Group Data 


The graphical scaling technique used to 
analyze the seminary data is a simple, quick 
iterative routine which yields a very good 
approximation to a computationally laborious 
least-squares solution for successive intervals, 
which was also developed by Diederich (see 6). 
After the seminary data had already been 
analyzed graphically, an IBM routine for the 
computational procedure was developed, so 
the Air Force data were treated according to 
the. punched-card technique (15). When the 
420 scale values obtained in this way were 
examined, it was again found that there was 
adequate agreement between corresponding 
AB and BA values, so the two estimates were 
averaged for each distance. The resulting 210 
averages represented all possible distances 


among the 21 statements, The general multi- 
dimensional scaling technique (7) was used to 
obtain the configuration of the statements, and 
the matrix of projections was again found to 
be of Rank 2. 

Two distinct streaks of points were again 
evident in the two-dimensional configuration, 
and the structure was translated slightly and 
rotated orthogonally to the same orientation 
as for the seminary data. This rotated, trans- 
lated configuration for the Air Force group 
appears in Figure 6. The horizontal factor in 
this figure represents a punishment dimension, 
having Statements 9, 12, 14, 15, 16, 17, and 18 
in the propunishment direction and Statements 
8, 10, 11, 18, 19, 20, and 21 in the antipunish- 
ment direction. A war dimension is located 
oblique to the punishment dimension at an 
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Angle — 68* 


Figure 6. Dimensional configuration of 21 attitude statements for the Air Force 


officer candidate group. 


angle of approximately 68*, with Statements 
1, 2, 3, 4, and 5 lying in the prowar direction 
and Statements 6 and 7 in the antiwar direc- 
tion. The 68° angle between these dimensions 
indicates a positive correlation between per- 
ceived prowar and propunishment attitudes 
for the Air Force group. 

The two configurations described above can 
be considered to be adequate dimensional 
representations of the experimentally obtained 
distances, since, apart from small residuals, 
only two dimensions were necessary to account 
for relationships among 21 stimuli, and no 
indications of non-Euclidean properties were 
found, ie, there were no markedly negative 
latent roots. 


A Comparison of the Structures Obtained from 
the Seminary and Air Force Groups 

A comparison between the final rotated 
structures obtained from the seminary and Air 
Force groups (Figure 5 vs. Figure 6) indicates 
an extremely high amount of agreement. It is 
quite evident from these figures that the two 
structures are essentially identical. The angle 
between the two dimensions in the seminary 
structure is 60*, while in the Air Force struc- 
ture the angle is 68*. This shift in axis position 
is in an expected direction in terms of the 
probable differences between the "actual" atti- 
tudes of the Air Force and seminary groups. 
One might expect a theological seminary popu- 
lation to be more "humanitarian" than a 
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military population. If this is the case, the 
“actual” attitudes of the seminary group would 
be expected to exhibit a higher correlation 
between antiwar and antipunishment attitudes 
than the "actual" attitudes of the Air Force 
group. If the “actual” attitudes of the Ss in 
this particular experimental situation affect 
attitutes as perceived in others, then the shift 
in axes toward a higher correlation for the 
seminary than for the Air Force group is in 
an expected direction. However, the angular 
difference of 8° is probably not significant. 


DISCUSSION 


The dimensional configurations obtained 
from both the seminary and Air Force groups 
certainly demonstrate that perceived relation- 
ships among attitudes toward war, capital 
punishment, and the treatment of criminals 
can be adequately represented in dimensional 
terms. Apparently, then, individuals do think 
of these attitudes as having a definite structure, 
and when asked to make judgments concerning 
attitude statements, they respond in terms of 
such dimensional frames of reference. The con- 
figuration obtained from both the Air Force 
and seminary groups is essentially the same as 
Type B given in Figure 2, with the war dimen- 
sion drawn obliquely to the punishment di- 
mension. Within this frame of reference, the 
two groups find it possible to conceive of 
people in each of the four quadrants, but it 
should be noted that certain kinds of people 
would be almost inconceivable to them. For 
instance, both groups would have difficulty in 
conceiving of a person who was in favor of 
capital punishment of extreme criminals and 
at the same time against punishing minor 
criminals. 

The finding of an underlying structure in an 
attitude domain, when considered along with 
some of the specific properties of the structure, 
has important implications for attitude mea- 
surement and theory. Since one of the more 
reasonable ways to measure multidimensional 
attributes is to apply appropriate unidimen- 
sional scales along each of the dimensions 
involved, dimensional configurations such as 
those found in the present experiment would 
aid considerably, first of all, in determining the 
number and the nature of the relevant dimen- 
sions and, secondly, in selecting and evaluating 
the appropriate unidimensional scales. It is 
striking that the attitude statements in both 


the Air Force and seminary structures form 
straight lines, indicating a unidimensionality 
in the original Thurstone scales, at least for 
the items considered in the present experiment. 
Similar studies investigating the linearity of 
other scales would be interesting and very im- 
portant in justifying the use of these scales in 
attitude measurement. Another important as- 
pect of these configurations is the fact that the 
scales on attitude toward capital punishment 
and attitude toward the treatment of criminals, 
which are admittedly similar in content but 
nevertheless had been independently estab- 
lished by equal-appearing interval techniques, 
should be perceived as precisely the same uni- 
dimensional continuum. This finding has im- 
portant theoretical implications, since it sug- 
gests that different attitude variables, which 
had been separately abstracted on a priori 
grounds, can be reasonably thought of in terms 
of a single dimension. These results, however, 
pertain only to "perceived" attitudes and, no 
matter how suggestive they are, the structure 
of "actual" attitudes should be investigated 
before such a finding is incorporated into atti- 
tude theory. It well may be, for instance, that 
the "actual" attitudes of the Ss in the pres 
ent experiment were three-dimensional, even 
though the groups perceived the attitudes in 
others in terms of a two-dimensional con- 
figuration. In other words, the results of the 
present study are immediately applicable to 
considerations of the ways in which individuals 
conceive of attitudes as being structured "out 
there," but further research is necessary before 
they may be applied to considerations of indi- 
viduals’ own attitudes. In any event, multi- 
dimensional analysis applied in attempts at à 
systematic understanding of the psychology of 
attitudes offers the hope that the host of trait 
names and labels in the attitude area can be 
represented in terms of a limited number of 
variables. 

The high correlation between the configura- 
tions obtained from the Air Force and seminary 
groups indicates a definite similarity in the 
ways in which these groups perceive attitudes 
in this particular domain as being structured. 
It is not unreasonable that perceived attitude 
structures should be similar for two diverse 
groups, since similar frames of reference are 
important for efficient communication, but 
the essential identity of the two structures 
requires comment. 

Previous studies (5, 14, 16) have shown à 
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persistent correlation between a person’s own 
attitudes and his estimates of the attitudes of 
others. If the seminary and Air Force groups 
differ, as they probably do, with respect to 
attitudes toward war and punishment, the 
seemingly projective nature of the task of esti- 
mating the opinions of others would lead one 
to expect a difference between the structures 
obtained in the multidimensional analysis. 
However, the above studies are not really com- 
parable to the present experiment, since they 
involved estimates of group opinion obtained 
from subjects who were themselves members 
of the group. Also, as Campbell (2, p. 23) has 
pointed out, “. .. in a group of any size, the 
respondent has an uneven acquaintanceship, 
and probably associates more with those of 
tastes like his own. Basing his estimates of 
group opinion upon his own experience in the 
group, his error may be in part a ‘sampling 
bias’ as well as biased perception.” 

The information supplied the subject in the 
present experiment was such as to be favorable 
to objective perception and unfavorable to 
projection. In one sense the information sup- 
plied was minimal; the other person, whose 
attitude relationships were to be estimated, 
was characterized only by complete agreement 
with a single statement. In the absence of a 
detailed characterization of the other person, 
the S evidently ascribed to his attitudes some 
kind of modal or average structure. It might 
' be argued that minimal information is a favor- 
able condition for eliciting projective responses, 
but the information supplied in the present 
case, although minimal in characterizing the 
other person, was quite specific concerning iso- 
lated aspects of his attitudes. A great deal of 
Specific information is given to the subject 
when he is told that another person strongly 
agrees with the statement, “There is no con- 
ceivable justification for war," and it is not 
difficult to evaluate objectively the extent of 
this person's agreement with the statement, 
"On the whole, wars do the world some good." 
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17. A Technique and a Model for Multi-Dimensional 


Attitude Scaling 


ROBERT P. ABELSON 


The majority of techniques for scaling atti- 
tudinal judgments or responses produce scales 
which are one-dimensional. The scale consists 
of a single continuum of opinion along which 
are located a succession of opinion items (or 
of people. This continuum generally ranges 
between pro- and anti- the attitudinal object. 
Examples of such one-dimensional techniques 
are Thurstone scales) Likert scales? paired 
comparisons scales? successive intervals scales,* 
and Guttman scales.5 It is apparent that in all 
of these methods the property of one-dimen- 
sionality has been forced on the scale by the 
choice of items and by the analytical method 
and by no means necessarily represents the 
dimensionality of the set of all commonly held 
attitudes in the domain. It is certainly true 
that in many attitude areas (religion, for ex- 
ample), different individuals may be favorable 
(or unfavorable) in different ways, on different 
levels, or for different reasons. For each of 
these scaling methods there exist procedures? 


a Reprinted from Public Opinion Quarterly, 
Winter, 1954-1955, 405-418, with permission of the 
author and the publisher. This study was con- 
ducted at Yale University as part of a coordinated 
program of research on attitude and opinion 
change, financed by a grant from the Rockefeller 
Foundation, 
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1 Thurstone, L. L., and E. J. Chave, The Measure- 
i of Attitude. Chicago: Univ. of Chicago Press, 
? Likert, A., "A Technique for the Measurement 
of Attitudes," Arch, Psychol., 1982, No. 140. 

3 Thurstone, L. L, “Psychophysical Analysis,” 
Amer. J. Psychol, 1927, 38, 268-389. 

* Saffir, M., “A Comparative Study of Scales Con- 
structed by Three Psychophysical Methods,” Psy- 
chometrika, 1937, 2, 179-198. 

5 Guttman, L., “The Cornell Technique for Scale 
and Intensity Analysis," Educ. psychol. Measmt., 
1947, 7, 247-279. 

* Green, B. F., "Attitude Measurement,” in G. Lind- 


to determine the internal consistency of "scala- 
bility" of the final scale. If the scale is inter- 
nally consistent, it is one-dimensional. But if 
it is not internally consistent, then items must 
be rewritten or discarded or the scoring manip- 
ulated in some fashion. The methods do not 
provide for multi-dimensionality of the mate- 
rial as an alternative to one-dimensionality, 
and this is an unfortunate limitation on the 
power of scaling methods. Lazarsfeld's latent 
class analysis? is non-dimensional, since it re- 
sults in the specification of a number of dis- 
crete nominal classes of people which have no 
necessary relation to each other along one or 
more psychological continua. Coombs, mean- 
while, has proposed ranking methods which 
are intended to yield multi-dimensional or- 
dinal scales, but there is considerable doubt 
as to the practicality of the methods. He 
cannot handle fallible data as yet. Anything 
but perfectly reliable judgments prevent the 
occurrence of a solution. Factor analysis has 
been used gingerly? but is subject to some 
reservations.1° Thus, in the study of attitudes, 
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there is a dearth of satisfactory methods which 
permit a multi-dimensional analysis. 

In psychophysics, however, multi-dimensional 
scaling methods have recently come into prom- 
inence (Richardson, Attneave, Torgerson).11 
These methods scale physical stimuli in a multi- 
dimensional psychological space. The funda- 
mental concept involved is that of psycho- 
logical distance. There is nothing to prevent 
the transference of multi-dimensional method 
from the psychophysical to the attitude domain. 
Social distance is a well-known social-psycho- 
logical phrase.12 Psychological distance is a 
concept employed in approach and avoidance 
gradient theory!? and in Lewin's field theory.14 
We think we know what we mean by psycho- 
logical distance. If psychological distances can 
be analyzed as though they were physical dis- 
tances, it will be possible to obtain a “map” 
of the way in which an individual structures 
the similarities and differences among attitudes 
in a given domain. On such a map, short 
(psychological) distances would represent (psy- 
chological) similarity, or agreement, and long 
distances would represent dissimilarity, or dis- 
agreement. Multi-dimensional scales based upon 
. the interpretation of dissimilarities or disa- 
greements as distances have already been 
constructed with non-psychophysical stimuli 
(Klingberg,15 and Osgood and Suci16). 

There is a theorem due to Young and House- 
holder which permits a multi-dimensional map 
to be constructed from a set of interpoint dis- 
tances dj, between each point j and every other 
point k.17 Torgerson has given procedures for 


Opinion Measurement," in S. A. Stouffer, et al., 
Measurement and Prediction, Princeton: Princeton 
Univ. Press, 1950. 

11 Richardson, M. W., “Multidimensional Psycho- 
physics," Psychol. BulL, 1938, 35, 659-660. (Ab- 
stract). Attneave, F., "Dimensions of Similarity,” 
Amer. J. Psychol., 1950, 63, 516-556. 

12 Bogardus, E. S., "Measuring Social Distances,” 
J. Appl. Sociol., 1925, 9, 299-308. 

13 Miner, N., "Comments on Theoretical Models 
Illustrated by the Development of a Theory of 
Conflict," J. Personal., 1951, 20, 82-100. 

14 Lewin, K., Principles of Topological Psychology, 
New York: McGraw-Hill, 1936. 

15 Klingberg, F. L., "Studies in Measurement of the 
Relations Among Sovereign States," Psychometrika, 
1941, 6, 335-352. 

16 Osgood, C. E., and Suci, G. J., "A Measure of 
Relation Determined by Both Mean Difference and 
Profile Information," Psychol. Bull., 1952, 49, 251- 
262. 


17 Young, G., and A. S. Householder, “Discussion 


obtaining the coordinates of n stimuli from 
the inter-stimulus distances when the data from 
which the distances are computed is not com- 
pletely reliable.18 The dimensionality of the 
space into which the stimuli are scaled is not 
fixed in advance; it is determined by the data, 
although an attempt is made to limit the di- 
mensionality if the data is willing. 


ANALYTICAL TECHNIQUE 


Scaling Methods for Obtaining 
Inter-Stimulus Distances 

N subjects judge the subjective dissimilarity 
of objects j and k on an m-point scale. A judg- 
ment of 1 indicates maximal similarity, a judg- 
ment of m indicates maximal dissimilarity, and 
the other integers cover the range in between. 

It is desired to find S;,, the scale value of the 
dissimilarity or distance between j and k. With 
n stimuli there are 14 n(n — 1) judgments to 
be made by each subject. 

To construct a scale for one subject, take 
the judgment directly and call it $5. This pro- 
cedure is bound to be insensitive and not 
highly reliable; however, precedent for treating 
integer judgments as though they formed a 
true interval scale (i.e., where the interval from 
1 to 2 is equal to the interval from 2 to 3, etc.) 
is provided by Stephenson and by Osgood and 
Suci. Maps for single individuals were actually 
computed in connection with the experimental 
study reported in this paper. They are some- 
what unstable due to the unreliability of single 
judgments, but are by no means unreasonable. 

If the number of subjects is between 1 and 
25 or thereabouts, equal-appearing intervals 


-scaling can be used; that is, the median judg- 


ment of the N subjects for a stimulus pair is 
taken as S,,. If, in computing the medians, the 
upper boundary of the first category is assigned 
the value 1 and the upper boundary of the 
mth category is assigned the value m, the scale 
values will have a possible range from à to 
m — à. 

With as many as 25 subjects (or, if they are 
available, many more than that) the method of 
successive intervals is a superior way to find the 
scale values from the m-category judgments. 
Variations of the method of successive intervals 


of a Set of Points in Terms of Their Mutual Dis- 
tances,” Psychometrika, 1938, 3, 19-22. 

18 ‘Torgerson, W. S., “Multidimensional Scaling: I. 
Theory and Method,” Psychometrika, 1952, 17, 401- 
419. 
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have been offered by Saffir, Mosier, Edwards, 
Thurstone and Edwards, Gulliksen,? and in 
slightly different form, by Attneave and Garner 
and Hake.2° Of these treatments, that of 
Thurstone and Edwards is perhaps the most 
readable, while that of Gulliksen is the most 
precise. 

In both the equal-appearing and successive 
intervals methods, the scale values are deter- 
mined save for an arbitrary constant. It is 
blandly assumed that zero is where it ought to 
be on the scale; that is, that a true psycho- 
logical distance of zero would actually be rep- 
resented as such by the scaling method. For 
the equal-appearing intervals method the range 
of scale values is from à to (m — 3). But with 
the successive intervals method, it is possible to 
get negative scale values by certain methods. 
If this happens it is imperative to add a con- 
stant to all values in such a way that the 
smallest scale value is some small arbitrarily 
chosen positive number. With this precaution, 
it is usually safe to set Sir» the scale value, equal 
to dj, the distance. A more detailed discussion 
of the additive constant problem is presented 
elsewhere.?! The entire multi-dimensional scal- 
ing procedure using successive intervals analysis 
is presented in great detail by Messick.?? 


EXPERIMENTAL STUDY 
The Stimuli 


Twelve statements relating to war, arma- 
ments and communism were used in the study. 
The statements are listed below. 


19 Mosier, C. L, “A Modification of the Method of 
Successive Intervals,” Psychometrika, 1940, 5, 101- 
107; Edwards, A. L., “The Scaling of Stimuli by the 
Method of Successive Intervals,” J. Appl. Psychol., 
1952, 36, 118-122; Thurstone, L. L., and Edwards, 
A. L, “An Internal Consistency Check for the 
Method of Successive Intervals,” Psychometrika, 
1952, 17, 169-180; Gulliksen, H., “A Least-Squares 
Solution for Successive Intervals Assuming Unequal 
MN Deviations," Psychometrika, 1954, 19, 117- 
20 Garner, W. R., and H. W. Hake, “The Amount 
of Information in Absolute Judgments,” P. 'sychol. 
Rev., 1951, 58, 446-459; Attneave, F. A., “Method of 
Graded Dichotomies for the Scaling of Judgments,” 
Psychol. Rev., 1949, 56, 334-340. 

21 Messick, S, and Abelson, R. P., "The Additive 
Constant Problem in Multidimensional Scaling,” 
Psychometrika, 1956, 21, 1-16. 

x Messick, S., “The Perception of Attitude Rela- 
tionships; A Multi-Dimensional Scaling Approach 
to the Structuring of Social Attitudes,” Ph.D. The- 
Sis, Princeton Univ., 1954. 


Abbreviation 
1, A very tough American 
policy toward Russia is a 
good idea, even if it leads 
to war. (Tough) 


2. We should not fight Russia 
unless she commits direct 
aggression with her armed 
forces. 

3. War is the only means of 
ridding the world of Com- 
munism. 

4. Concessions of territory to 
the Communists would be 
desirable if this prevented 
war. (Concessions) 

5. There are many effective 


(Fight only) 


(Rid) 


means to fight Communism (Effective 
without war. means) 
6. War is a necessary and use- 
ful instrument. (War) 
7. Peaceful relations with Rus- 
sia should be sought by 
bargaining across the con- 
ference table. (Bargaining) 


8. Communism is a greater 
force for good in the world 
than we have been led to (Pro- 


believe. Communism) 
9. America should increase her 
present armaments enor- 
mously. (Arm) 
10. War is tragic and terrible. (Peace) 
11. America should reduce her 
present armaments substan- 
tially. (Disarm) 
12. Communism is a false, dan- (Anti- 
gerous ideology. Communism) 


The key statements in the study are numbers 
6, 8, 9, 10, 11, and 12. These provide the end- 
points for three anchoring dimensions—a Com- 
munism dimension, a War dimension, and an 
Armaments dimension. The other statements 
are felt to be anchored in these dimensions. 
A further hypothesis is that the Armaments 
dimension is a function of the War dimension 
and the Communism dimension. That is to say, 
armaments are instrumental to the value of 
Anti-Communism and to either the value of 
War or the value of Peace (depending on how 
one looks at it), and thus do not lead to an 
independent third dimension. It is probable 
that the dimensionality of an attitude map is 
equal to the number of independent value 
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TABLE 1 
D Il. 
Pro-Armament lo ol Anti-Communist lo Anti-Communist 
| | | 
| | Pro-Armament | Pro-Disarmament 
Pro-War | | Anti-War O-------- | -------- o 
o——— --0 o-------- | 2------- o 
| Pro-War Anti-War 


Pro-Disarmament lo ol Pro-Communist 


III. 

Anti-Communist 

Cre Spee Ak COP TERM UR E 
Pb Armament. 9, erm mmm 
Pro-War fos eU A LL t 


| 
| 
| 
| 
| 


o Pro-Communist 


Pro-Communist 


o Pro-Armament 
o Anti-War 


(The circles indicate the statements and the dotted lines hypothetical dimensions along which state- 
ments intermediate between the extremes would lie.) 


systems covered by the attitude statements used, 
with statements instrumental to those values 
not increasing the dimensionality. (In the pres- 
ent study, it turns out that two factors are 
indeed sufficient.) Considering only the six 
anchoring stimuli, one might hypothesize three 
major typologies of final attitude map. These 
types are sketched in Table 1. 

Type I might be called the Popular Con- 
ception: Armament is equated with Anti- 
Communism; Armament is unrelated to (at 
right angles to) a Pro-War attitude. 

Type II would correspond to a Quaker 
philosophy: Armament is equated with being 
Pro-War, and Disarmament with being Anti- 
War; Armament is unrelated to Anti- 
Communism. 

The armament dimension might lie at an 
intermediate angle between those depicted by 
Types I and II. Armaments could be seen as 
being both Anti-Communist and Pro-War, and 
as such, might constitute a conflicted atti- 
tudinal object. 

Type III represents one solution to this pos- 
sible dilemma—it is the solution provided by 
a preventive war: Anti-Communists are seen 
to be Pro-War, and Armaments are directly 
instrumental to both. 

In order to decide whether these hypothe- 
sized typologies actually can be found in vari- 
ous samples of respondents, judgments were 
elicited from a group of one hundred Yale 


students (presumably representing a popular 
political position) and from a sampling in the 
New York area—fourteen students who are 
affiliated with the Socialist Party. (The political 
position of the Socialist Party is Anti-Commu- 
nist but pacifistic.) 


Method 


Each subject was given the following instruc- 
tions: 


"For each pair of statements, say A and B, 
imagine two people having a discussion in 
which one of the two people makes statement 
A and the other person makes statement B. 
Make a judgment of the extent to which you 
feel that these two people are in agreement. 
Use an eight-point scale. A rating of 1 indicates 
maximum agreement, a rating of 8 indicates 
extreme disagreement, and 2, 8, 4, 5, 6, and 7 
indicate intermediary degrees of agreement 
between these two extremes.” 


The subjects were also asked to state their 
own agreement or disagreement with each of 
the twelve statements on an eight-point scale. 
In addition to the similarity judgments and 
the personal agreements, thirteen political and 
social attitude items were administered. From 
the twenty-eight subjects scoring most politi- 
cally conservatively were selected two random 
samples of fourteen subjects each to compare 
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with the Socialist sample of fourteen subjects. 
A two-dimensional map was computed for each 
of these three groups. 


RESULTS: I. COMPARISON OF DIFFERENT 
GROUPS 


For each group, the set of median judgments 
of psychological similarity were manipulated 
according to Torgerson’s procedure to produce 
the two-dimensional map. For purposes of 
comparability of the three groups, each map 
was translated and rotated in such a way that 
the war dimension lay along the x-axis with 
its mid-point at the origin. 

"The maps for the Socialist group and one of 
the two Conservative groups are presented in 
Figure 1. 

Several interesting conclusions may tenta- 
tively be drawn from the mappings, although 
they do not conform very closely to the pat- 
terns hypothesized beforehand. 


Anti-Communism 


Ridding 
3 
o 5War 


-3 


-2 


Figure Ia. 


l. The maps are reliable. The two Con- 
servative groups are more like each other than 
either is like the Socialists. (Thus the map for 
the second Conservative group has been 
omitted.) 

2. The Socialists view Pro-War and Anti- 
Communist statements as being correlated. 
The angle between these two dimensions is 
42*. For the two conservative groups, this angle 
is 61* and 67*, respectively, representing con- 
siderably less correlation. 

3. The Armaments dimension is more closely 
associated with the War dimension for the 
Conservative groups than for the Socialist 
group. 

4. The Socialists view statements 2 and 5 
differently than do the Conservatives. State- 
ment 2 is: "We should fight Russia only if 
attacked directly by her armed forces." State- 
ment 5 is: “There are many effective means to 
fight Communism without War." The Socialists 
see these statements as farther removed from 


Fight only 
2 


o 
Effective means 
5 


o 


Bargaining 
7 


o 
10 
2 Peace 3 


Concessions 
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ll 
Disarm 
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Two-dimensional map of twelve statements (rotated and translated)— 
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Anti-Communism 
12 


-3 


-2 
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Peace 


Bargaining 


Concessions 


Pro-Communism 


Figure Ib. Two-dimensional map of twelve statements (rotated and translated )— 


Conservatives I. 


a simple Anti-War statement than do the 
Conservatives. 

5. If the variance of the loadings in the 
Pro-War—Anti-War direction and the variance 
of the loadings in the direction orthogonal to 
this are calculated, the results are as shown in 
Table 2. 

For the Socialists, a greater proportion of the 
variance is due to considerations associated 
with War and Peace. It might be inferred from 
this that War vs. Peace is proportionately more 
important to the Socialists than to the Con- 


TABLE 2 


servatives. For the Socialists, the statements 
which contribute variance independent of this 
are chiefly No. 2 and No. 5. For the Con- 
servatives, the independent variance is contrib- 
uted mainly by the Pro- and Anti-Communist 
statements. 


Results: Contour Maps 


In the attitude maps discussed above, no 
mention was made of the groups’ own personal 
agreements or disagreements with each single 
statement. It is possible to incorporate this 


———————— —— P" 


Variance Due 


to War 

Dimension 
Socialists 18.08 
Conservatives I 15.32 
Conservatives II 15.12 


Variance 
Independent of 
War Dimension 


Proportion 
Due to 
War Dimension 


4.83 0.789 
5.57 0.734 
5.42 0.736 


MULTI-DIMENSIONAL ATTITUDE SCALING 153 


information into the map directly. Each state- 
ment in the space can be assigned a “valence.” 
Then the valence (with a sign and a magnitude) 
of any arbitrary point P(x, y) in the space can 
be computed from the formula 


NS YO) 
"= Da EET li 


where V(P) is the valence at the point P, V(j) 
is the valence of statement j, and dp, is the dis- 
tance from statement j to point P. The denomi- 
nator is written (1 + d2p;) so that the valences do 
not tend to infinity as the point P moves close to 
a statement j. In the present form, the valence 
at such a point will be the valence of j plus the 
individual contributions from the more distant 
points. This formulation assumes that there 
can be a discrepancy between the valence of a 
statement and the valence of the point in the 
psychological space which it occupies. Equation 
(1) is analogous to the equation for the elec- 
trical potential at a point in a field of electrical 
charges, except that (1 + d?) is used in place of 
simply d. Valence is analogous to potential.?% 

Carrying the analogy further, one can draw 
contours of equal valence in the attitude space. 
Of particular interest is the contour of zero 
valence, since it represents the locus of all po- 
tential statements toward which the subjects 
are neutral. 

In the present study the valence of each state- 
ment for each experimental group was chosen 
as the median agreeableness M; of the state- 
ment. Since a median of 4.0 presumably repre- 
sents indifference (ratings of agreeableness were 
made on an eight-point scale), the formula em- 
ployed was 

V(j) = 4.0 — M, (2) 
Contour maps were constructed from Equations 
(1) and (2), aided by graphical interpolation. 
The maps for two of the three groups are pre- 
sented in Figure 2. These are the same maps as 
in Figure 1, with contour lines added. 

In reading these contour maps, a "hill" (a 
concentration of contours of high valence) is to 
be interpreted as a region in the attitude do- 
main within which all statements or potential 
Statements are highly regarded by the group. 


23 This model is reminiscent of Lewin's "life space" 
in that we have a cognitive map with valences oper- 
ative within its boundaries. The map is a map of 
cognitions and not of the physical world, however. 
We have attempted here to build more quantative 
detail into such a map. 


The valleys (concentrations of contours of low 
valence), are regions wherein all statements or 
potential statements are highly disfavored by 
the group. Detailed consideration of the pos- 
sible significance of these contour maps leads to 
several sorts of propositions, of which four are 
presented here: 


l. The steepness of the hills and valleys (the 
degree of bunching together of the contours) is 
an indicator of the tension in a region. Where 
the contour of zero valence lies in the middle 
of a region of high tension, strong conflict 
exists. (It is not possible to tell on the basis of 
the contour map alone whether such conflict is 
among the members of the group or applies 
intrapersonally to all members of the group.) 

2. When two or more widely separated high 
hills exist, the domain can be said to be unin- 
tegrated. Vacillation in the attention of the 
group from the region of one hill to the region 
of the other hill or hills is likely. 

3. If there is a large discrepancy between 
the valence of a statement and the valence of 
the surrounding region, particularly if the dis- 
crepancy involves valences of different signs, 
the situation is unstable; there is a predisposi- 
lion toward change of the domain. That is, 
local tension (as defined in Proposition 1) exists. 
The change could come about through a change 
in valence of the statement or a change in its 
position, or else through compensatory changes 
in the valences and positions of nearby state- 
ments. This proposition suggests some of the 
ways in which multi-dimensional scaling might 
prove useful in attitude-change research. 

4. The contour of zero valence, in general, 
intersects the line drawn between the opposite 
poles of any value dimension (eg, War vs. 
Peace) The point of intersection divides the 
range of alternatives along that value dimen- 
sion into those which the group approves and 
those which the group disapproves. 


In applying these propositions to empirical 
contour maps, it is necessary to realize that a 
great many assumptions are involved in the 
construction and interpretation of these maps. 
The most doubtful assumptions are these: 


(a) All attitude statements in the domain 
with strong valences have been represented 
among the list of statements given to the sub- 
jects for judgment. (The inclusion of important 
new statements might seriously alter the con- 
tours.) 
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Figure 2a. Contour map—Socialists. 


(b) The statements in the domain are related 
to each other in the spontaneous thinking of 
the subjects and are not merely an aggregate of 
items thought to be logically related by the 
experimenter and represented to the subjects as 
a domain. (Thought processes of the “logic- 
tight compartment" type would not. allow for 
the action of valences at a distance, though the 
present model assumes such action.) 


Further qualifications which are probably less 
serious than (a) or (b) are these: 


(c) Sampling errors and judgmental variabil- 


ity are involved in the original similarity judg- 
ments and estimates of valence. 

(d) The valence estimates are based on de- 
gree of agreement with the statements: no ac- 
count is taken of the intensity of agreement. 

(e) There is no guarantee that Euclidean 
geometry is appropriate to attitude scaling. 

(f) The analytical procedure is only an ap- 
proximation; while two dimensions may be 
generally adequate for the statements on the 
average, two or three statements may require a 
third dimension. It is furthermore possible that 
something analogous to the specificity of tests 
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in factor analysis should have been postulated. 
Each statement would then lie partially in a 
dimension of its own. 

(g) Equation (1) might not be the best 
choice. 


In spite of all these limitations, the maps at 
hand seem to be reasonably interpretable on 
the basis of the above propositions. 

On the basis of Proposition 1, one infers that 
the Conservative groups are in greatest conflict 
Over the statements concerning Peace, Bargain- 
ing, and Disarmament. The Socialists, on the 
other hand, show little or no conflict; only the 
Statement concerning Communism lies on the 
Opposite side of the zero valence contour near 


+0.4 
-29 


a region of high tension. There are, neverthe- 
les, three regions of very high tension sur- 
rounding the statements respectively concerning 
War, Armament, and Peace. 

Proposition 2 suggests that the region of posi- 
tive valence is much more highly integrated for 
the Socialists than for the Conservatives. How- 
ever, notice that in the regions of negative 
valence, neither of the groups is completely 
integrated. It would be nice to be able to make 
the statement: “The Socialists know quite well 
what they are for, but are not of one mind as 
to what they are against, whereas the Conserva- 
tives know very well what they are against, but 
are not sure what they are for.” This statement 
is almost, but not quite justified by the maps. 
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The attitude statements which are unstably 
located according to the terms of Proposition 3 
are Nos. 3, 7, and 9 for the Conservatives I, 
Nos. 7 and 11 for the Conservatives II, and 
Nos. 4, 8, and 12 for the Socialists. The valences 
of all of these statements are substantially dis- 
crepant from the valences of the localities in 
which they lie. 

Proposition 4 leads to very interesting results. 
Considering first the War vs. Peace value di- 
mension, we find that the Conservative groups 
will accept all positions varying along this di- 
mension except those that are very close to War. 
The Socialists, however, will only accept those 
that cover the half of the continuum nearest 
to Peace. On the Arm-Disarm dimension, the 
Conservatives will accept anything except a sub- 
stantial reduction of armaments, whereas the 
Socialists highly favor a substantial reduction of 
armaments and would presumably accept at 
most a very slight increase of armaments. On 
the Anti-Communist through Pro-Communist 
dimension, the Conservatives approve of all po- 
sitions on the Anti-Communist end and also of 
a small segment past the mid-point. On the 
other hand, the Socialists disapprove of posi- 
tions at both extreme ends, favoring only a 
more neutral set of positions nearer the Pro- 
Communist end (although it must be remem- 
bered that the Anti-Communist statement is 
unstably located by the Socialists.) 


SUMMARY 


A mathematical technique for obtaining 
multi-dimensional attitude maps is discussed. It 
can be used with any number of subjects. The 
attitude space is to be thought of as a frame of 
reference in which various attitudes are em- 
bedded because of their meaningful connections 
with the values which anchor the frame of ref- 
erence. The subjects are requested to judge the 
similarity of the members of pairs of attitude 
statements. High similarity or meaningful con- 
nection can be represented by small psycholog- 
ical distance and high dissimilarity by large 
psychological distance. The mathematical anal- 
ysis converts sets of distances into a psychological 
space with each statement occupying a certain 
coordinate position. 

The technique was applied to forty-two sub- 
jects. Twelve statements dealing with war, ar- 
maments and Communism were employed. 
Consistent differences were found between the 
attitude map of a group of Socialist subjects 
and the maps of two groups of politically con- 
servative subjects. There is a discussion of some 
of the interesting insights into the attitudes of 
the subjects provided by the maps. These can 
be uncovered by postulating a kind of Lewin- 
ian force field model in which the various state- 
ments exert attractive and repulsive forces 
throughout the space. 


18. Systematic Differences Between Individuals in 


Perceptual Judgments 


LEDYARD R. TUCKER 


OPTIMAL DECISIONS AND DIFFERENCES IN 
HUMAN JUDGMENTS 


Human judgments and differences between 
individuals in making these judgments appear 
to enter the process of making optimal deci- 
sions in two major areas of consideration. In 
the first area there is the problem of the mea- 
sure to be optimized. This may be a function 
of values and preferences of individuals. The 
second area involves the nature of operating 
systems and the laws of relations in these sys- 
tems. Differences between individuals which 
exist in the perception and understanding of 
such systems influence the perception and in- 
terpretation of information available to the per- 
son about the present state of a system, about 
the dynamic changes and movements taking 
place within a system, and in the relations 
between a system and its surround (or environ- 
ment). Further aspects of these differences be- 
tween individuals are in anticipated effects that 
may result from possible actions. Such judg- 
ments, undoubtedly, affect the optimality of 
subsequent situations, 

The major emphasis in this chapter is on 
methods for studying the structure of individual 
differences in judgments. Consider two logical 
extremes regarding individual differences. In 
one we might assert that every individual is 
like every other individual with perhaps some 
random effects in his judgmental process. In the 
other we could contend that every individual is 
Unrelated to every other individual in his judg- 
mental process. The first is a neat and tidy 
view—too neat and tidy. The second is chaos. 


* Excerpted from an article in M. W. Shelly and 
G. L. Bryan (Eds), Human Judgments and Opti- 
mality, John Wiley and Sons, New York, 1964, with 
Permission of the author and the publisher. 


It offers no promise for solution to problems 
of human judgments. Truth about the relations 
in human judgments among individuals in a 
population probably lies somewhere between 
these extremes. Undoubtedly the judgmental 
processes of individuals lie in some structured 
field, more complex than all being alike and 
not so diverse as to be chaotic. Knowledge of 
the structure of individual differences in this 
field should be important to the use of human 
judgments in optimal decisions. 


Factor Analytic Procedure for Study of 
Differences in Perceptual Judgments 

Several recent studies have used an adaption 
of factor analysis to study the structure of indi- 
vidual differences in perceptual structure. Fac- 
tor analysis is a technique originally devised to 
study the structure of individual differences in 
intelligence and abilities. Possible adaptations 
have been explored for application in other 
areas; the present use is one of the latest. 

Consider the mathematical outline in Figure 
1. The observed data is indicated in the table 
in the upper section. For the present use, pairs 
of stimuli are considered from a list of stimuli 
and a measure of the perceived difference be- 
tween the stimuli in each pair is obtained for 
each individual. There is a column for each in- 
dividual. For example, in a study by Helm and 
Tucker (1962), “Individual Differences in the 
Structure of Color-Perception,” Helm obtained 
measures of the relative perceived differences by 
Torgerson's multidimensional-triad ratio-scaling 
procedure between pairs of colors for 10 colors 
painted on porcelain disks. In multidimensional 
scaling such measures are considered as distances 
between points in a space representing the 
structure of perception. In the procedure used 
interpoint distances were obtained for each in- 
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dividual and were recorded in a table (sce Fig- 
ure 1). There was a row for each pair of colors 
and a column for each individual. 

Once a table of interpoint distances is com- 
piled, a question can be raised about the simi- 
larity or differences of entries in the several 
calumns. If all individuals were perceiving 
the differences between stimuli alike, each col- 
umn would be like every other column. In 
matrix algebra terminology this matrix of inter- 
point distances would be of rank 1. A random 
effect would merely add random discrepancies. 
If there were two kinds of people with separate 
perceptual systems, the columns in the table of 


interpoint distances could be sorted so that 
those in one section would be alike and those 
in a second section would be alike, but the 
columns in the two sections would differ be- 
tween sections. This would be a special case 
of a rank 2 matrix. For example, in the study 
by Helm and Tucker there were 10 individuals 
with normal color vision, indicated by standard 
color vision tests, and four who were color- 
blind. We expected that possibly the 10 with 
normal color vision would form one group and 
the four color-blind individuals would form a 
second group. This expectation was borne out 
only in part. The color-blind individuals did 
tend to form a separate group, but those with 
normal color vision did not do so. There was 
considerable nonrandom variation in the results 
among those with normal color vision so that 
the columns of observations formed a progres- 
sion from one type to a second. The matrix for 
normal color vision is a more general example 
of a matrix of rank 2. 

The method of analysis used does not depend 
on finding separate groups or orders among the 
columns of the table of interpoint distances. 
It is a procedure related to principal axes factor 
analysis and is based on a development by Carl 
Eckart and Gale Young (1936) for the approxi- 
mation of one matrix by another of lower rank. 
The tables of factor coefficients and the approx- 
imation formula in Figure 1 are laid out for a 
rank 2 approximation. A table of factor coeffi- 
cients for stimulus pairs appears in the middle 
section of Figure 1, and a table for individuals 
appears in the bottom section. According to 
the approximation formula at the bottom of the 
figure, the coefficient for a stimulus pair on fac- 
tor I is multiplied by the factor coefficient for 
an individual on factor I. A similar product of 
coefficients for factor II is formed, and the sum 
of these two products is obtained. This yields 
the approximation to the interpoint distance 
for the selected stimulus pair and individual. 
For a rank 1 approximation there would be 
only one column of factor coefficients for the 
stimulus pairs, one row of factor coefficients for 
the individuals, and one term or product in 
the approximation formula. For a rank 3 ap- 
proximation there would be three columns for 
the stimulus pairs, three rows for the individ- 
uals, and three products in the approximation 
formula. In this method of analysis an approx- 
imation of each rank is tried, and the sum of 
squares of errors of approximation is deter- 
mined. A higher ordered approximation is used 
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if it results in a somewhat lower sum of squares 
of discrepancies than an approximation of rank 
one less. One of the unsolved problems is the 
development of a satisfactory decision formula 
for determining the rank of approximation to 
be used. In the color discrimination data three 
factors appeared to be needed to account for 
the facts observed. The approximation ac- 
counted for 98% of the sum of squares of the 
original data, and the sum of squares of discrep- 
ancies accounted for the remaining 2%. 

A convenient geometric analogy interprets 
each row of the table of factor coefficients for 
individuals as giving the coordinates of points 
for the individuals on one dimension. Thus, if 
there are two rows in this table, each individual 
may be represented as a point in a two-dimen- 
sional space. Consider Figure 2. There is a sim- 


d 


Increase in intermediate 
differences 
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Decrease in intermediate 
differences 


Figure 2. Factor structure for individuals. Color 
discrimination data. 


ilar figure for individuals of normal color vision 
in the Helm and Tucker study. When the data 
for all subjects were analyzed, three dimensions 
were needed. However, the three-dimensional 
space could be viewed so that one dimension 
had only color-blind individuals with nonzero 
coordinates. Figure 2 presents the remaining 
two dimensions. All individuals have nearly 
equal positive coordinates on axis I, the vertical 
axis. The coordinates on axis II, the horizontal 
axis, range from a sizable positive value to 
smaller negative values. Points for the observed 
individuals are the blackened circles. The open 
circles represent hypothesized individuals and 
are discussed shortly. If there had been two 
groups of individuals, the observed points would 
have appeared in two corresponding clusters. 
The progression from one type of perception of 
stimulus pair differences, or interpoint dis- 
tances, corresponds to a series of observed 


points running from one side to the other, as 
in Figure 2. Both configurations can be repre- 
sented in a two-dimensional space which corre- 
sponds to a rank 2 approximation of the inter- 
point distances. A common terminology is to 
equate the rank of the approximation to the 
dimensionality of the space and speak of a two- 
dimensional approximation, a three-dimensional 
approximation, etc. 


IDEALIZED INDIVIDUALS: DEFINITION AND 
DESCRIPTION 


Having found that it is necessary to use two 
dimensions to represent the judged interpoint 
distances between the colors by normal-visioned 
subjects, we are faced with an interesting and 
important question: what is the nature of these 
differences in perception? To answer this ques- 
tion, the hypothesized individuals indicated by 
open circles in Figure 2 were chosen to repre- 
sent the progression of perceptions by the nor- 
mal-visioned individuals. These hypothesized 
individuals are frequently termed “idealized in- 
dividuals.” For each idealized individual the 
coordinates in the space formed a new column 
of factor coefficients for individuals, and inter- 
point distances for stimulus pairs were com- 
puted by the approximation formula in Figure 
1 by using the experimentally determined factor 
coefficients for stimulus pairs. If the interpoint 
distances for a given idealized individual have 
been determined, the methods of multidimen- 
sional scaling may be applied to gage the na- 
ture of the corresponding perceptual space. In 
this space the stimuli are represented by points 
at distances from one another equal to the ob- 
served interpoint distances. For example, if the 
distance from stimulus A to B is one unit, from 
stimulus B to C, four units, and from stimulus 
A to C, five units, these interpoint distances 
can be represented by placing all three stimuli 
on a straight line, since the distance from A to 
C is equal to the sum of the distances from A 
to B and from B to C. If the three interpoint 
distances were three, four, and five units, the 
stimuli would be at the corners of a right tri- 
angle. For more stimuli it may be necessary to 
use more than two dimensions in the percep- 
tual space to represent the observed interpoint 
distances. 


Idealized Individuals in Color Discrimination 


For the color-discrimination data there was a 
major two-dimensional perceptual space for 
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each of the idealized individuals, with the 10 
colors forming a more or less circular pattern 
as expected from the selection of colors, the 
stimuli having been selected from one plane of 
the color pyramid to form a circle of the differ- 
ent hues. The most perfect fit to the interpoint 
distances by this two-dimensional pattern was 
found for idealized individual C. Discrepancies 
in interpoint distances, as approximated by the 
circular perceptual space for the other idealized 
individuals, produced an interesting progres- 
sion. Idealized individual E corresponded to an 
overstatement of the intermediate-sized inter- 
point distances. For example, the interpoint dis- 
tance between a red and a yellow was large for 
idealized individual E when compared with the 
distance between the red and a green. For ide- 
alized individual A the reverse tendency was 
indicated; the intermediate-sized interpoint dis- 
tances were understated. 

Implications of the individual differences 
among individuals of normal color vision for 
optimal decisions is worthy of further study. It 
might be found that people like idealized indi- 
vidual E could make sharper matches of new 
colors to standard colors or to colors of existing 
objects. People like idealized individual A might 
be better able to pick out complementary colors. 
A possible consequence is that different indi- 
viduals would make better optimal decisions 
based on judgments of colors dependent on the 
task. 


A STUDY OF PERCEPTION OF POLITICAL 
THINKING 


In a second study to be reviewed Tucker and 
Messick (1963) used judgments of perceived 
similarity of political thinking of prominent 
people, such as Roosevelt, Dewey, Talmadge, 
McCarthy, Douglas, Hitler, Nehru, and Stalin. 
There were 20 men on the list. Each person 
chosen as a subject was asked to judge the 
amount of similarity-dissimilarity in political 
thinking for each pair of men. These ratings 
were used directly as interpoint distances. A 
total of 39 subjects, all undergraduates at the 
University of Illinois, took part in this study 
and were selected from a larger group who 
made the ratings and answered a questionnaire 
about their political views. In choosing the sub- 
jects, an attempt was made to obtain a wide 
variety of political views. The analysis of the 
individual differences in the ratings yielded a 
three-dimensional space in which the coordi- 
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Figure 3. Political relations data. Interpretation 
for idealized individuals. A. One dimension: United 
States men versus foreign men. B. Two dimensions: 
(1) Republican-Democrat; (2) good-bad. C. Possibly 
seven dimensions of rather complex structure. Ab- 
stracted from Ledyard R. Tucker and Samuel Mes- 
sick (1963). 


nates on the first dimension bore nearly equal 
positive values. Figure 3 presents the plot be- 
tween the second and third dimensions. An 
interesting feature of this plot is the approxi- 
mate triangular dispersion of the points for 
which there is no apparent explanation. Three 
idealized individuals were selected, as indicated 
in Figure 3, A near the heaviest cluster of ob- 
served individuals, B toward the upper left, 
and C toward the lower left. The only trend 
related to the political views of the subjects is 
that a larger portion of the liberal democrats 
favoring labor are closer to idealized individual 
B than the other idealized individuals. 

The perceptual spaces for the three idealized 
individuals offer interesting comparisons. For 
idealized individual A there was only one di- 
mension in the perceptual space, with the men 
from the United States clustered at one point 
and the foreign men at another. One must con- 
clude that many of the undergraduate students 
made only this rather crude distinction between 
the political figures used in the study. Idealized 
individual B discriminated between Republicans 
and Democrats and made a second discrimina- 
tion that seemed to be based on an evaluative 
judgment. This second dimension of the per- 
ceptual space separated Joseph Stalin, Adolf 
Hitler, Alger Hiss, Henry Wallace, and Senator 
McCarthy from the rest. It must be remembered 
that this view is shared by a number of liberal 
Democrats, which may account for the inclu- 
sion of McCarthy in this dimension. The per- 
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ceptual space for idealized individual C is 
rather complex and involves rather particular- 
ized distinctions of a larger number of dimen- 
sions. It may be that the direction in the space 
of individuals in Figure 3 away from the A-B 
line toward idealized individual C represents 
overstatements of intermediate differences which 
make many of the stimuli appear further apart 
and increase the dimensionality of the percep- 
tual space. 


A STUDY OF PERCEPTION OF RELATIONS 
AMONG PERSONALITY TRAITS 


Darhl Pedersen (1962) studied for his PhD 
dissertation the perceived personality trait rela- 
tionships of two samples of 50 undergraduate 
students plus 160 others who answered the ques- 
tionnaires. The central questionnaires listed a 
sample of 600 stimulus pairs drawn from the 
1225 pairs generated from 50 trait names. Rat- 
ings were on a scale from opposite through 
unrelated to same. Factor coefficients for individ- 
uals determined for each sample of 50 subjects 


on half the pairs judged were extended to esti- 
mates of the coefficients for all 260 subjects in 
the study. This permitted an evaluation of the 
reliability of the results determined by using 
different subjects and different trait pairs in the 
factor analysis. For three major factors the re- 
liability correlations were more than .80. Figure 
4 presents the plot between dimensions II and 
III for all subjects. As in the preceding studies, 
the points were all positive on dimension I. 
Again, as in the political relations data, three 
idealized individuals were chosen, and the na- 
ture of the perceptual space was investigated. 
The space for idealized individual A, for which 
the coordinate on dimension I was high, ap- 
peared to be the standard stereotype for rela- 
tions between the traits. Idealized individual B 
had a high coordinate on dimension II and 
represented a distortion of the stereotype 
wherein many items, for which the stereotyp- 
ical view was that the adjectives in each item 
were unrelated, the distortion was to judge the 
adjectives more opposite in meaning. We might 
speculate about the relations between this di- 


Factor structure of individuals 


Figure 4. Trait relations data. Interpretation for idealized individuals. A. Stan- 
dard stereotype of trait relations. B. Response set increasing dissimilarity between 
related trait pairs. C. Authoritarian point of view. Abstracted from Darhl M. 


Pedersen, 1962. 
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mension and the dimension differentiating peo- 
ple with normal color vision and the dimension 
toward idealized individual C in the political 
relations study. All three dimensions appear to 
represent transformation of the intermediate 
relations. 

Dimension III in Pedersen's study is especially 
interesting. There are indications that it repre- 
sents the view of people with authoritarian per- 
sonalities. Pedersen obtained a number of 
measures on the subjects in his study, and corre- 
lations were calculated between these measures 
and the individual coefficients on the three di- 
mensions of individual differences. Correlations 
with all outside variables for dimensions I and 
II were quite low. However, dimension III cor- 
related negatively with ability measures and 
scientific interest (in the Kuder Preference Rec- 
ord) and positively with content scores on au- 
thoritarianism, as measured by the California 
F Scale and derived questionnaires. Shifts in the 
perceptual space involved the traits character- 
ized by the following adjectives: dishonest, 
weak, passive, submissive, rational, predictable, 
aggressive, domineering, defensive, and unin- 
teresting. Important among these shifts is that 
domineering became positively related to hon- 
est, stable, and rational and negatively related 
to others, such as tense, selfish, and insensitive. 
Being rational was less like being intelligent 
and more like being unpredictable, domineer- 
ing, active, interesting, and brave. Being aggres- 
sive shifted from being similar to selfish, tense, 
and unsociable toward being similar to unsel- 
fish, honest, mature, and sociable. Defensive de- 
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C. Alternative Measurement Techniques 


19. The Indirect Assessment of Social Attitudes 


DONALD T. CAMPBELL 


In the problem of assessing social attitudes, 
there is a very real need for instruments which 
do not destroy the natural form of the attitude 
in the process of describing it. There are also 
situations in which one would like to assess 
"prejudice" without making respondents self- 
conscious or aware of the intent of the study. 
At the present time there are few if any indirect 
tests which could confidently be used for either 
of these purposes. There are, none the less, a 
considerable number of techniques that have 
been partially explored and validated. It is the 
purpose of this paper to survey such techniques 
and to present a point of view with regard to 
the problem of indirect measurement. 

Current interest on the part of social psychol- 
ogists in the indirect assessment of attitudes is 
perhaps primarily an aspect of the larger pro- 
jective test movement in personality study. 
However, as will be seen in the course of this 
survey, there has been an interest in this ap- 
proach from the very first efforts in attitude 
measurement (81), anticipating by some ten 
years the current interest in “projective tech- 
niques.” 

The terms “indirect” and “projective” have 
been used to refer to both disguised, and to 
non-structured measurement efforts. Using these 
two terms alone, one could distinguish four 
types of tests: 


l. Non-disguised-structured: the classic direct 


* Reprinted from Psychological Bulletin, 1950, 47, 
15-38, with permission of the author and the Amer- 
ican Psychological Association. 


attitude tests of Thurstone (78), Likert (49), et 
al. 

2. Non-disguised-non-structured: the free-re- 
sponse interview and questionnaire approaches, 
the biographical and essay studies. 

3. Disguised-non-structured: the typical “pro- 
jective” techniques. 

4. Disguised-structured: tests which approxi- 
mate the objective testing of attitudes. 


It is with the latter two categories that the pres- 
ent review is concerned, although some items 
that properly belong in category 2 will be in- 
cluded where they represent deliberate efforts 
at "projective" attitude testing. 

While a formal division of content on these 
two criteria suffices, the writer will, in discus- 
sion, use a third which overlaps but does not 
duplicate the other two. "This third criterion 
is that of dependence upon voluntary self- 
description as opposed to diagnosis based upon 
differential performance in an objective task. 
Upon this latter criterion rather than "struc- 
turedness” will rest the primary distinction be- 
tween our two types of methods. 


DISGUISED-NON-STRUCTURED TESTS OF 
SOCIAL ATTITUDES 


In this category are included those techniques 
which offer the respondent opportunity for the 
spontaneous expression of attitudes in an am- 
biguous or non-structured setting. Most of these 
techniques are borrowed quite directly from 
well established clinical tools, and will be clas- 
sified on that basis. 
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Approaches Based on the 
Thematic Apperception Test 


Though by no means the first, the indirect 
test most widely cited in previous surveys of the 
literature (18, 42, 54, 82) is that of Proshansky 
(65). He intermingled ambiguous pictures of 
labor situations with the more usual T.A.T. 
scenes. The pictures were presented to a group 
by means of slides, with each person being asked 
to write for two and one-half minutes on what 
he thought the slide represented. Slides were 
shown for only five seconds. Proshansky found 
that ratings made from the resulting descrip- 
tions correlated .77 and .67 with a direct verbal 
scale of attitudes toward labor. A more elab- 
orate proposal for the use of a similar technique 
with attitudes toward Negroes and Jews has 
been made (51), but as yet, no results are avail- 
able. 

Specially designed Thematic Apperception 
pictures have also been used by Frenkel-Bruns- 
wik, Sanford and Levinson in their extensive 
study of the personality correlates of prejudice 
(24). In this research the purpose was not so 
much to measure prejudice as to get a more 
detailed and qualitative picture of its expres- 
sion. Nevertheless, the complicated interrela- 
tionships they found qualify the use of such 
pictures as attitude measuring instruments. For 
example, many prejudiced women told warmer 
and more sympathetic stories to a picture of a 
Negro mammy than they did to pictures of an 
elderly white woman (23). While such a finding 
is consistent with personality theory, it points 
to the danger of an over-simplified one-to-one 
interpretation of such material. 

Essentially identical in technique is the “Hu- 
man Relations Test" developed by the staff of 
the College Study in Intergroup Relations (15). 
This consists of ten drawings, each portraying 
an ambiguous intergroup contact. For example, 
à crowded street-car scene and a basketball 
scene are used. The respondents are asked to 
write a short story about each scene, telling 
what happened, how it came about, and what 
will happen next. The test is available on 
printed cards or projection slides, and since it 
is being used by a number of cooperating col- 
leges, and at a wide variety of age levels, there 
is some promise of a standardized test emerging. 

Johnson (39) has successfully studied the de- 
velopment of Anglo-Spanish attitudes in the 
Southwest through stories told by children to a 
specially designed series of pictures. He used six 


carefully selected conflict situations. These were 
duplicated in three forms: with all Anglo char- 
acters (for use with English-American children), 
with all Spanish characters (for use with Spanish 
children), and with mixed Anglo-Spanish char- 
acters (for use with both groups). Attitudes were 
assessed by contrasting responses to identical 
situations when they involved Anglo-Spanish 
conflict and when depicted by their own group 
members only. Quantification was achieved by 
having two judges categorize individual re- 
sponses on a number of dimensions. Reliability 
coefficients for six sub-groups were over .90. 


Approaches Utilizing Doll Play Techniques 


In the first explicit attempt to use a projec- 
tive technique in the assessment of social atti- 
tudes, Dubin (19) utilized toys in a fashion 
similar to the "play techniques" of the clinical 
psychologist. Using ten adult respondents, he 
asked them to "construct on this table a dra- 
matic scene or scenes of the world as you see it 
today" and later "Now make a dramatic scene 
or scenes of the world as you would like it to 
be." From report and pictures of these scenes, 
three judges were able to estimate answers on 
21 direct attitude questions dealing with labor, 
the Negro, internationalism, etc. with an aver- 
age rank order correlation of .49. 

For use with children, Arnold Meier has de- 
veloped an interviewing aid, the “What Would 
You Do?" test, which is “projective” in the pres- 
ent sense. Doll cut-outs including minority chil- 
dren are manipulated on background drawings 
which depict such situations as entering the 
home, school scenes, etc. (15). 

Similar to these approaches is the “movie 
story game” developed by Evans and Chein (21) 
for use with children from eight to twelve. On 
a miniature stage, Negro and white dolls are 
manipulated, with the child being asked at var- 
ious points what the identified doll would say. 
Particular attention is paid to the patterning 
of segregation responses with acquaintances and 
with strangers. Preliminary studies indicate ef- 
fective disguise and general meaningfulness for 
the test. 

Hartley and Schwartz (33) have combined 
doll-play with pictorial material in the investi- 
gation of intergroup attitudes of children five 
to seven years of age. Montage background 
compositions carried characteristic symbols of 
Jewish religion in one case, Catholic in another, 
and in a third indicated typical middle class sur- 
roundings with no indication of religion. Iden- 


THE INDIRECT ASSESSMENT OF SOCIAL ATTITUDES 165 


tical family sets of dolls were placed upon these 
three backgrounds, and all were available for 
use by the child in playing out situations, such 
as a birthday party, school bus, meals, etc. In 
the preliminary report it is indicated that chil- 
dren identify with some accuracy the religious 
symbols, and that their play indicates inter- 
group attitudes in a meaningful way. 


Sentence Completion Tests 


At the Ohio State University, Shirley Wilcox 
Brown is using a modification of Rotter's test 
(68) in investigating attitudes toward the Negro. 
A 40-item schedule has been prepared, in which 
are imbedded some 20 sentence fragments deal- 
ing specifically with the Negro problem. Exam- 
ples of relevant and neutral items are as follows: 


I feel. <5 

Skin color . . . 

I hate... 

Maybe... 

Some lynchings . . . 

The KEKAYI 

It seems to me that segregation . . . 
Negro body odor . . . 

Racial intermarriage . . . 


mE SOP mo orf p 


oc 


In this test, no real effort at dissembling is 
made. The cooperating respondent is asked to 
volunteer a description of his own attitudes, 
with no more disguise than in the direct tests. 
The instructions read “Complete these sen- 
tences to express your real feelings. Try to do 
every one. Be sure to make a complete sen- 
tence!” Preliminary applications of the test in- 
dicate its power to discriminate between criteria 
groups. Scoring has been done by both coding 
and rating procedures. The indications are that 
the test can provide a reliable measure of atti- 
tudes, 

Another twist to the sentence completion no- 
tion is being used in the study of personality 
and prejudice among school children conducted 
by Else Frenkel-Brunswik and Harold E. Jones 
(22). Here the test avoids all mention of minor- 
ity groups, but provides stereotypic statements 
which may be completed with names of various 
minority groups or others. Examples of items 
are: 


Are there some people who are mean? 
WHAT PEOPLE? 

It would be better if more of a certain type 
of people were allowed to come into the United 
States. WHAT PEOPLE? 


Some people are poor and it is their own 
fault. WHAT PEOPLE? 


This test has elicited mention of foreign and 
minority groups from about one third of the 
children to which it has been administered. 
From another third or so come one or more 
anti-prejudice statements. A portion of the stu- 
dents make no responses classifiable in either 
way, and are thus not effectively evaluated. 
Using a net score (subtracting anti-prejudice 
responses from the total of prejudiced ones) 
corrected reliability figures run around .6 to .8 
and correlations with a highly reliable direct 
test are on the order of .5. The approach is 
most satisfactory for the comparison of groups 
of respondents, and for the evaluation of the 
relative salience and. extremity of attitudes to- 
ward different minority groups. In addition, 
unique data on the uniformity of stereotyping 
are provided. 

While these two sentence completion tests 
make a deliberate effort to achieve a projective 
or indirect character, they are not too different 
from any free-response questionnaire on atti- 
tudes. Compare, for example, Zeligs' approach in 
which school children were asked to "write the 
most interesting true sentence" they knew about 
each group, within a one minute limit (84). 


Miscellaneous Non-Structured Techniques 


The Rosenzweig Picture Frustration Test has 
also been modified for the measurement of so- 
cial attitudes, a task to which it is obviously 
appropriate. This test requires the respondent 
to fill in the balloons in a series of cartoon 
drawings involving frustrating face-to-face inter- 
group contacts (6). 

Probably belonging within the limits of our 
general category is the study by Fromme (25). 
He presented to the respondents five political 
cartoons, each with four alternative captions, 
covering a wide range of pro and con opinion. 
The respondent was asked to pick the best cap- 
tion, and this choice, plus the discussion result- 
ing, was utilized in a qualitative analysis of 
attitude structure. 

The procedures just discussed are all indirect 
and “projective” in current usage of the terms. 
At the risk of some oversimplification, the assets 
they have in common may be emphasized and 
the stage set for contrasting them with the sec- 
ond group of procedures. 

While tests such as these may be in part dis- 
guised, in that the experimenter does not tell 
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the respondent his real purpose and may indeed 
substitute a false justification, their primary 
asset is that of securing an expression of atti- 
tudes in a more natural and spontaneous form, 
allowing opportunity for each individual to 
"project" upon a neutral screen his own inte- 
gration of the problem. In contrast, the usual 
direct paper-and-pencil attitude test, requiring 
the endorsement of prepared items, may be said 
to force artificially the expression of attitudes 
into a preconceived and common mold. The 
great advantage of these non-structured tests is 
one of freedom, rather than of disguise. While 
some respondents may complete their assign- 
ments unaware of the experimenter's interest, 
in a tense situation one could hardly use any 
of the above devices expecting to get uncon- 
scious or uncensored expressions of attitude 
from uncooperative respondents. Essentially 
these tests are "voluntary" in the same sense 
that the usual attitude test, interest inventory, 
or neuroticism test is. The respondent is told 
(either directly or in effect) that there are no 
right or wrong answers, and he is placed in a 
situation in which a voluntary and arbitrary 
performance is acceptable. 


DISGUISED-STRUCTURED TESTS OF SOCIAL 
ATTITUDES 


The approaches which we will consider below 
differ from the ones mentioned above perhaps 
only in degree or in relative emphasis. Yet the 
distinction involved is important. The charac- 
teristics of these disguised, non-voluntary tests, 
can be stated in a number of ways: 


The respondent participates in an objective 
task, in which he seeks right answers. The vol- 
untarism of the usual projective techniques is 
lacking. To the respondent, the situation is 
similar to that of an achievement or ability 
test. 

All respondents have a common motivation 
in taking the test. All, we may assume, are 
seeking to perform adequately on the same ob- 
jective task. Attention is focused on a common 
goal, oblique to the experimenter's purpose. 

Rather than stressing the freedom and lack 
of structuring, there is an attempt to diagnose 
attitudes from systematic bias in the perform- 
ance of an objective task. The test may be 
highly structured—directly scorable—and still 
offer opportunity for unconsciously operating 
bias to distort behavior in a systematic and di- 
agnosable manner. 


Here is a simple formula for constructing 
such a test. Find a task which all your respon- 
dents will take as objective, and in which all 
will strive to do well. Make the task sufficiently 
difficult, or use a content area in which the re- 
spondents have had little experience or oppor- 
tunity for reality testing. Load the test with 
content relative to the attitude you study. Look 
among the responses for systematic error, or for 
any persistent selectivity of performance. If 
such be found, it seems an adequate basis for 
the inference of an attitude. 

To be sure, this approach has to some extent 
been used in the projective techniques discussed 
above. The T.A.T., for example, is introduced 
as a “test of creative imagination” (58), but 
little attempt is made to carry out effectively 
this pretense pattern. Respondents accept the 
task with a wide diversity of motivations, vary- 
ing levels of seriousness, curiosity, and suspi- 
ciousness. And while all may be “projecting,” 
the motivational situation is highly un-uniform, 
and meaningful comparisons between the re- 
sponses of the serious, the suspicious, and the 
playful can hardly be made. Similarly, the “play 
technique” has been given to college men with 
the excuse that the experimenter was interested 
in “ideas for moving picture plays” and wanted 
the respondent to construct a “dramatic scene” 
(58, p. 553); and the sentence completion test 
has been given as an ability test (as Ebbinghaus 
intended it), with instructions to complete the 
sentences in any grammatical fashion as quickly 
as possible. But the disguise in these typical 
projective tests seems to be half-hearted, or im- 
plausible, weakened by the explicit freedom of 
response allowed. At best, the respondents ac- 
cept the task as meaningless, or as a psychol- 
ogist’s mystery. It is probably common that 
something of the intent of the experimenter is 
divined, and the task completed despite this. 


Information Tests 


To make the model of the indirect, dis- 
guised, non-voluntary test of social attitudes 
more explicit, let us examine the use of an 
information test for attitude diagnosis. The 
reader has probably commented at one time or 
another upon the inextricable interrelation- 
ship between people's attitudes and what they 
take to be facts. You would all probably agree 
that in a detailed test of information, the direc- 
tion of people's guesses or misconceptions will 
frequently bear a relationship to their attitudes. 
In a complementary fashion, a given person's 
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knowledge is apt to reflect in its unevenness 
his selective awareness and retention, or his 
biased sources of information. Newcomb has 
dramatically portrayed the non-random char- 
acter of right and wrong answers on an infor- 
mation test in his study of “The Effect of 
Social Climate upon Sonie Determinants of 
Information” (62). While selective awareness 
and divergent sources of iriformation were pri- 
marily involved, Newcomb comments with 
regard to the difficult information items “the 
direction of guessing is altogether likely to be 
weighted toward the subject's attitude. If this 
reasoning is correct, the . . . test tends to be- 
come itself an attitude test." Coffin (14) and 
Smith (76) have found beliefs of factual type 
statements to correlate highly with related atti- 
tude tests. Indeed, in looking over the high 
correlations found in the literature on the rela- 
tionship between information and attitudes, 
one is tempted to guess that some re-interpreta- 
tion of them is in order. Is it not likely that 
many items were such that persons of liberal, 
tolerant attitudes found. guessing easier? 

Loeblowitz-Lennard and Riessman have pro- 
posed that an information test be used to 
measure indirectly attitudes toward Negroes 
and Jews (50). Hammond (31) has actually 
used such a test to measure attitudes toward 
“labor-management” and "Russia" and his 
work is worthy of some detailed comment here. 
We have mentioned that not only guessing 
behavior but also differential patterns of infor- 
mation may be diagnostic of attitudes. Ham- 
mond eliminated the latter by the “error- 
choice" technique, in which the respondent 
was forced to choose between the two alterna- 
tive answers each of which was, by intent, 
equally wrong, but in opposite directions from 
the correct answer. Such items were: 


"Average weekly wage of the war worker in 
1945 was (1) $37.00, (2) $57.00." “Financial re- 
ports show that out of every dollar (1) 166, 
(2) 3¢ is profit.” 


Scoring these information guesses as attitude 
items gave total scores on 20 such items that 
differentiated a labor union group from two 
business clubs with almost no overlap. In spite 
of the small number of cases (18 union and 42 
business) the critical ratios were 11.3 on the 
Labor-Management, and 12.5 on the Russia 
questionnaire, Reliabilities on the two scales 
were roughly estimated at .78 and .87 respec- 
tively, 


What we have here is an objective test situ- 
ation (which could be made more so by using 
more alternatives and including a correct one) 
in which people’s errors are not random, but 
systematic. The presence of biased performance 
clearly necessitates the inference of some under- 
lying process, which we choose to call attitude. 
The claim for face validity on such a test seems 
to be stronger than the one that can be made 
for either the direct or unstructured attitude 
test. Is not systematically biased performance 
in dealing with environmental actualities the 
essential practical meaning of attitude? 

Parrish (64) has more recently used a regular 
multiple choice information test as an indirect 
indicator of attitudes, in conjunction with a 
direct attitude scale and a second indirect test 
involving estimating opinion poll results (see 
below). All three tests effectively reflected 12 
minutes worth of anti-Kuomintang propa- 
ganda in approximately the same degree when 
the greater reliability of the direct test is con- 
sidered. The pseudo-information test had 32 
items and a reliability of .66 (using Kuder- 
Richardson formula 14 modified for multiple 
levels of response). It correlated .67 (corrected 
for attenuation) with the direct test (10 items, 
reliability .91) given at the same time (both 
anonymously, so there is no reason to suspect 
the direct test). The correlation with the pub- 
lic opinion estimate test was .59. 

Kremen (43) attempted to evaluate the ef- 
fect of student role-playing of a discrimination 
episode upon attitudes toward the Negro, us- 
ing a multiple-choice information-type indirect 
test and a direct test. While neither test re- 
flected the role playing in mean scores, her 
findings have importance for attitude measure- 
ment, inasmuch as role-playing lowered the 
relationship between the direct and indirect 
test. The results from ten sociology classes 
(five pairs in which the instructor and subject 
were the same) are presented below: 


Correlation between 
Direct and 
Indirect Tests 


39 36 .20 53 .38 
54 .70 35 58 .60 


Role-playing classes: 
Paired control classes: 


For all classes combined, the direct test had a 
reliability of .89, the indirect test .42 (Kuder- 
Richardson, Formula 14). The explanation of 
this phenomenon is not obvious from her data 
insofar as analysed. She does not report relia- 


168 ATTITUDE MEASUREMENT 


bility values separately for the ten classes, 
although this step is planned in further analysis 
of the data. 

R. B. Cattell (12, 18) has undertaken a large 
scale exploration into the "objective" mea- 
surement of attitudes, using a rationale very 
similar to the one presented here for non- 
voluntary attitude tests. In this study, a large 
number of ingenious methods have been tried, 
and others suggested.! The methods have been 
applied to a number of attitude-interests, such 
as playing more sociable games, excelling in 
one's career, and being smartly dressed, which 
differ greatly from the social-problem topics of 
most of the other studies reviewed. Two of the 
methods may be classified with information test 
approaches. His measure of “False Belief 
(Delusion)” (12) was presented to respondents 
as a multiple-choice information test. A sample 
item: “During the war church attendance in- 
creased greatly and, since V-J day it has: 
(declined slightly; tended to increase still more; 
stayed at its high peak; returned to its pre-war 
level; fallen to its lowest point since 1920)." 
Presumably the different responses were given 
different weights as in a direct test and as done 
by Parrish and by Kremen. Applied to a reli- 
gious attitude, ten such items gave a Spearman- 
Brown reliability of .53, and correlated .38 
with a paired-comparisons preference test, and 
10 with records kept of time and money 
expenditure. 

Cattell's other information test approach was 
more straight-forward. As mentioned above, we 
may expect that a person will reveal his atti- 
tudes not only in the direction of his "errors" 
but also in the selective character of the "right 
answers" he knows. 

Cattell prepared 10-item information tests 
for each of a number of attitudes. The items 
dealt with the “knowledge required in follow- 
ing the course of action connected with the 
attitude” (12). For twelve topics, these tests, 
scored in terms of the number of items correct, 
provided Spearman-Brown reliabilities ranging 
from .18 to .92, with an average of .59. Cor- 
relation with records of time expended on the 
various activities averaged but .16. With money 
spent on activities the average value was .15, 
and with paired-comparison preferences, .16. 


! Omitted from the present review are a number 
of physiological and related psychological measures, 
such as P.G.R., writing pressure, pulse pressure, 
speed of reading, fluency, and speed of decision, 
which Cattell also investigated. 


In this approach, Cattell has resurrected an 
approach to the "objective" or "indirect" 
measurement of vocational interests which 
flourished during and after the first world war. 
Fryer (26) and others provide summaries of 
such research. No longer called "interest" tests, 
these selective vocabulary and object recogni- 
tion tests are today firmly established among 
vocational selection and guidance tools. 


Estimation of Group Opinion 
and Social Norms 4 

As Travers (79), Wallen (80), and others 
have demonstrated, there is persistent correla- 
tion between a person's own attitude and his 
estimate of group opinion. While a few per- 
sons may chronically underestimate the popu- 
larity of their own opinions, the prevailing 
tendency is to overestimate the size of the 
group agreeing with one's self. Biserial corre- 
lations between endorsement of a dichotomous 
item and the estimation of the proportion of 
the group endorsing show few values below .2 
and characteristically run from .4 to .6 or 
higher. This relationship seems to indicate the 
possibility of assessing a person's attitudes by 
giving him the entirely objective task of esti- 
mating group opinion. If one regards corre- 
lation with direct attitude expression as vali- 
dation, one starts with very high single item 
validities indeed. If from a single item, one 
can predict a person's attitudes to the extent 
of a correlation of .40, how well might one not 
be able to predict from twenty such items? 
In several attempts along this line at the Ohio 
State University, conducted under the direction 
of the present author, the results have fallen 
considerably below such optimistic hopes. From 
combined estimates of group opinion, the best 
correlation obtained with a direct attitude 
score based on the same items runs around .60, 
not better than the best single item-estimate 
correlation. More detailed exploration of the 
technique is being made, however. Parrish (64) 
used a public opinion estimate test, with items 
dealing with U. S. opinion on China, the 
opinions of Americans with experience in 
China, and the opinions of the Chinese. Only 
the first type are directly comparable to the 
Travers and Wallen situation, and these did 
not come out the best in the item analyses. 
The test, containing only 10 items, had a 
Kuder-Richardson reliability of .53, correlating 
67 with the direct test when corrected for at- 
tenuation. In this instance there was no direct 
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duplication of content in the direct attitude 
test items and in the opinion estimates. 

The technique is not entirely new. In 1929, 
Sweet, using a test designed by Goodwin Wat- 
son, found boys' estimates of group opinions 
valuable in diagnosing adjustment problems 
(77). More recently, Katz has reported results 
confirming the general relationship between 
own attitudes and estimates of group opinion, 
although he does not give correlation coeffi- 
cients (41). As part of Section A in their "Sen- 
timents" examination, Murray and Morgan 
(59) ask the respondent to guess what the ma- 
jority of people believes or prefers, e.g. "what 
are the three most popular things to do?" 
Newcomb (61) made one of the first uses of 
the percentage estimate of group opinion, but 
has not reported his results in terms of the 
responses of individuals. It might be pointed 
out that the correlation involved in these stud- 
ies is not necessarily to be interpreted as pro- 
jection. In a group of any size, the respondent 
has an uneven acquaintanceship, and probably 
associates more with those of tastes like his 
own. Basing his estimates of group opinion 
upon his own experience in the group, his 
error may be in part a "sampling bias" as well 
as biased perception. 

F. C. Bartlett (4, 5) and D. M. Carmichael 
(9) before the war were experimenting with 
the use of predictions of social events as diag- 
nostic of attitudes. The respondent is given a 
situation and some background information, 
and then asked to make a prediction. Predic- 
tions regarding the likelihood of cooperation 
between social groups have been used, for 
example. The approach can probably be pre- 
sented as a test of ability, and would seem to 
offer the possibility of wide variation in con- 
tent. McGregor (53) has also pointed out the 
relationship between predictions and personal 
attitudes, Using predictions of another kind, 
Davis (17) has devised a simple test for use 
with children which could easily be given in 
an objective frame of reference, and which al- 
ready has the desirable quality of asking the 
respondent to talk about somebody else rather 
than himself. (Indeed, his technique is practi- 
cally identical with the first “projective” per- 
sonality tests designed by Cattell (10) on the 
model of multiple-choice intelligence test items.) 
In Davis’ test, situations are presented, and the 
respondent predicts the protagonist's action by 
selecting from several prepared alternatives. 

Noland (63) used a projective approach 


toward estimating social norms with one set of 
items in his extensive study of absenteeism, 
asking “how important a cause of absenteeism 
do you think each of the following.” These 
single items predicted absenteeism with a cor- 
relation of .50 or better. The value of the 
finding is weakened, however, by having as a 
criterion of absenteeism voluntary reports on 
the same (unsigned) questionnaire. 


Tests of Ability to Do Critical Thinking 


During the last three or four years of his 
life, John J. B. Morgan had been working on 
another technique that admirably meets our 
criteria (55, 56, 57). He used a syllogisms test, 
with a large number of possible conclusions 
provided, from which the respondent was to 
pick the logically correct one. Following Sells’ 
study of atmosphere effect, the syllogisms were 
invalid. In spite of the invalidity of the syllo- 
gism, college students will concentrate their 
choices on the one or two wrong alternatives 
favored by the atmosphere effect. To use this 
phenomenon in diagnosing attitudes, Morgan 
gave the same syllogisms in both impersonal 
or abstract form (e.g, "No A's are B's. Some 
C's are B's. From these statements it is logical 
to conclude: No C's are A's, etc") and with 
content ("A trustworthy man does not engage 
in deceitful acts. The bombing of Pearl Harbor 
by the Japanese was a deceitful act. From these 
statements it is logical to conclude: No Japa- 
nese are trustworthy, . . . etc."). By studying 
the shift in the popular responses from the 
nonsense to the meaningful form, Morgan at- 
tempted to diagnose group attitudes. While the 
technique needs much further study, it seems 
well worth such elaboration. In particular, an 
attempt should be made to control the effect 
of greater tangibility, per se, and also the intru- 
sion of evidence from outside the syllogism 
which might be confounded with purely atti- 
tudinal effects. Note that in this particular 
study, the shifts of the group showed more 
tolerance for the Japanese than otherwise. 
Lansdell has utilized the technique for study- 
ing covert attitudes toward marriage, attempt- 
ing to apply it for the diagnosis of the attitudes 
of a single individual (45). 

Ruch, in his 1937 edition of Psychology and 
Life (69, pp. 633-634), reports briefly upon a 
classroom experiment in which preferences for 
ethnic groups were reflected in the ability to 
judge the validity of syllogisms about the same 
groups. In this instance, no choice of conclu- 
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sions was offered, but rather the students 
judged the validity of the complete syllogism 
as stated. Lefford (46) has shown in more detail 
the disrupting influence of emotional subject 
matter upon such judgments. 

The syllogism approach, and a number of 
others, have been anticipated by Goodwin 
Watson's Measurement of Fair Mindedness, 
published in 1925 (81) By fair-mindedness 
Watson seems to have meant something roughly 
equivalent to tolerance, critical thinking, or 
open-mindedness. His six sub-tests were de- 
signed to provide scores on this trait, which 
they did with adequate reliability. But Watson 
recognized that in the "errors" or lapses from 
fair-mindedness were clues as to the biases or 
attitudes of the respondents, and a secondary 
"analytic" scoring was made in these terms. 
Three of the sub-tests we would characterize as 
being direct? and of the others, we will classify 
two under the heading of "critical thinking," 
treating the third—Moral Judgments—below. 
The Inference Test provided statements of fact, 
followed by several conclusions that might be 
drawn, only one of which is logically justified, 
the others offering opportunities for the intru- 
sion of personal biases in various directions. 
This test has been expanded in the Watson- 
Glazer Tests of Critical Thinking (28) which 
could be scored for various prejudices although 
they have not been so used as far as the present 
writer is aware. Gilbert (27) has made use of 
this same technique in a study in which high- 
school children were given hypothetical racial 
problems for which they could choose strictly 
logical conclusions or conclusions showing bias. 

In Watson's Arguments Test both pro and 
con arguments on various topics are presented, 
and the respondent evaluates these for their 
strength as arguments. The test taps the ten- 
dency for a person to view as strong those argu- 
ments which support his own point of view. 
Robbins (66) has recently confirmed the prin- 
ciples underlying this test, although his data 


2 These are the Word Cross-out Test in which the 
respondent crosses out “distasteful” words, as in 
Pressey's test, the Degree of Truth Test allowing 
for extreme or moderate statements of belief and 
disbelief and, the Generalization Test in which a 
number of statements about various groups are 
presented, and the respondent is to indicate 
whether the statement is true of “All, Most, Many, 
Few, or No” members of the group. (This is prob- 
ably one of the first tests of “stereotyping.” It might 
be classified as indirect if administered as an infor- 
mation test.) 


are not in such form as to indicate whether or 
not accurate inferences as to attitudes could 
be made from such judgments. He found that 
students performed better in judging the ade- 
quacy of reasons for points of view with which 
they agreed than those with which they dis- 
agreed. 


Tests Employing Bias in 
Perception and Memory 

The demonstrations of systematic attitudinal 
bias in highly “objective” perceptual and 
learning tasks have provided some of the most 
important developments of the past 15 years 
for attitude theory (2, 20, 48, 73, 74). The use 
of errors in seeing and remembering in the 
diagnosis of individual attitudes epitomizes 
the disguised, structured, and non-voluntary 
assessment of attitudes. 

Pioneers in the use of this approach are 
E. L. and R. E. Horowitz (37). In their study 
of the development of social attitudes among 
the children of a southern community they 
invented a number of techniques which have 
merited a great deal more re-use :han they 
have seen. While designed to portray group 
differences, many of them should be appro- 
priate to individual testing. Their Aussage 
Test involved exposing a complicated picture 
for two or three seconds, following which they 
tested for perception and memory through a 
series of standardized deliberately leading 
questions. For instance, the question "who is 
cleaning the grounds" to Picture 10 brings 
answers referring to a non-existent Negro in 
some 70% of the cases. To Picture 4, the mis- 
leading question “What is the colored man in 
the corner doing" brings a steadily increasing 
proportion (in reports from higher age groups) 
of menial activities for the non-existent Negro 
man. 

"Their Perception-Span Test involved a se- 
ries of posters each having pictures of 10 items 
mounted upon it. These were exposed for 10 
seconds, and the children asked to "tell all the 
pictures you can remember." While there were 
difficulties with perseveration and blocking, 
some age group differences were noted, with 
the younger children failing to note Negroes, 
and the older ones showing a selective aware- 
ness for them. A Recall Test, asking for the 
reproduction of ten words used earlier in à 
word association test, seemed also to indicate 
that in the younger years the word "Negro" 
was less well remembered than would be ex- 
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pected. The Pictorial Recognition Test involv- 
ing the recognition of faces previously exposed, 
finds the faces of whites more frequently recog- 
nized than the faces of Negroes. 

Seeleman's study (71) confirms this finding 
with respect to individual differences within 
the group. Presenting sets of Negro and white 
photographs she tested for later recognition by 
asking that the previously exposed pictures be 
picked from a larger group. The discrepancy 
between memory for white and Negro faces 
was checked against attitudes on a direct test. 
Using people in the extreme quartiles on the 
direct test only, the test and memory-bias scores 
correlated .64 and .71 with different popula- 
tions. In a paired-associates memory test in- 
volving complimentary phrases to be paired 
with white and Negro pictures, error discrep- 
ancy scores showed no correlation with the di- 
rect test, but the tendency to assign favorable 
phrases to the Negro (regardless of correctness) 
correlated .66 with the direct test in the one 
population for which the correlation is re- 
ported. These values are high enough to sug- 
gest that with refinement an indirect test 
might be achieved. 

On a preliminary experiment, Murray Jarvik 
(22) attempted to utilize memory distortion as 
an indicator of attitudes. Sixth, seventh and 
eighth grade students in a rural school were 
read five minute stories and then were asked to 
write down all they could remember of the 
story. The stories had simple dramatic plots, 
but were full of confusing detail and lacunae 
with regard to specific names, characteristics, 
and ethnic identities. Opportunity was given 
for memory distortion in the direction of com- 
mon stereotypes. The results were essentially 
negative. However, motivation and literacy 
were low, and the task was a part of a rather 
long testing program. Note also that memory 
distortion usually occurs over longer spans of 
time, and that had the test employed selective 
retention rather than distortion it might have 
been more successful. 

Cattell (12, 13) utilized three techniques in- 
volving memory and perceptual distortion. 
His test of "Immediate Memory" was based 
upon selective recall for attitude relevant state- 
ments from sets of twelve presented at one- 
second intervals. In the total, there were over 
500 statements. Spearman-Brown reliability fig- 
ures for eleven attitudes ranged from .13 to 
86, averaging .50. Correlations with other 
measures were essentially zero (with time ex- 


penditure, average .01; with money expendi- 
ture, average .03; with paired-comparisons 
preferences, average .13; with the information 
test (see above), .01), In the scoring, both “facil- 
itating” and “frustrating” statements relative 
to the attitude were pooled. Cattell recommends 
that in future research these be separated. 

Cattell's “Distraction” method involved the 
exposure for ten seconds of statements related 
to the attitudes. Around the statement were 
scattered twelve or thirteen nonsense syllables. 
Subjects were held responsible for recalling the 
statement and the nonsense syllables. Scoring 
was in terms of decrement in nonsense syllable 
recall for statements related to a given attitude. 
The original hypothesis was that the stronger 
the attitude, the poorer the nonsense syllable 
learning, through distraction. Actually, the op- 
posite effect was found. With a Spearman- 
Brown reliability of .64 for one attitude (being 
smartly dressed), the correlation with the 
paired-comparison preference measure was 
—.29, with time expenditure, —10, with money 
expenditure, —.08, and with the information 
test (discussed above) —.35. Since only ten 
items were involved, these values are regarded 
as sufficiently high to encourage further work 
with the method. The “Misperception” method 
involved the tachistoscopic exposure of attitude 
statements for one second, with the respondent 
responsible for repeating the statement and 
noting the misspellings. The overlooking of 
misspelling was regarded as a sign of strong 
attitude. Ten items dealing with the desire to 
play more sociable games provided a Spearman- 
Brown reliability of .43, and correlations with 
the paired-comparisons preference measure of 
.00, the expenditure of money of —.01, and 
the information test of —.13. 

These methods of selective memory and selec- 
tive distractibility have been anticipated in the 
indirect measurement of occupational interests 
by Burtt in 1923 (7, 8, pp. 334-343). In a 
paired-associates memory test, one half of the 
pairs were relevant to agricultural engineering. 
The ratio of memory for these pairs as opposed 
to memory for other pairs correlated .30 with 
instructors ratings as to interest and industry. 
In a cancellation test, in which irrelevant 
words were to be crossed out, different para- 
graph contents were employed. Differentials in 
efficiency of cancellation correlated .30 with 
the above-mentioned instructors ratings. Con- 
trary to Cattell's findings, the greater distracti- 
bility was associated with stronger interest. 
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Tests Involving Ability to Judge Character 


Murphy and Likert, in Public Opinion and 
the Individual (60), utilized a wealth of tech- 
niques anticipating the projective testing move- 
ment. Among these was the “Ratings from 
Photographs.” Following the general frame- 
work of Rice's classic study on stereotypes, they 
provided labeled pictures of a union president, 
a railroad magnate, a pacifist, a Negro cham- 
pion of Negro rights, etc. Respondents were 
asked to judge from these labeled photographs 
the character of the pictured person, in terms 
of courage, selfishness, intelligence, conceit, 
sympathy, practicality, and sentimentality. 
Contrary to expectation, they found no rela- 
tionship between attitudes as measured in a 
variety of paper and pencil tests and these 
picture ratings. However, since the making of 
character judgments from photographs comes 
as near to being a paradigm of psychiatric 
projection as can be found in present test ap- 
proaches, it might be suggested that this tech- 
nique be further explored before being aban- 
doned. In line with the tenor of the present 
discussion would be the suggestion that the 
objectivity or "pseudo-objectivity" of the task 
be increased, with the use of some such label 
as “social intelligence test” or the like. 

In precisely this model is the “Faces Game” 
originally developed by Marion Radke (67, 
p. 50-51) and being used as modified by Isidor 
Chein and Iljana Schreiber in research of the 
Commission on Community Interrelations of 
the American Jewish Congress (16). In this test 
children are asked "Let's see how good you 
are at telling what people are like just by look- 
ing at their faces." Thirty-four sets of four 
pictures (two white and two Negro for all but 
four sets) are presented, each with a question 
such as “one of the girls in this row is very 
lazy and never bothers to do anything” or “one 
of the boys in this row is the best sport in his 
class.” Two scores are provided: one the “white 
salience” score (the total mentions of white 
children for items either good or bad), and 
the prejudice score (the ratio of unfavorable 
choices of faces of the other group to total 
choices of other groups), The test produces 
significant individual and group differences, 
and showed retest reliabilities over a six month 
period of .50 and .86 for white and Negro 
children, respectively, on the prejudice score. 
For the white salience score, the values were 
:52 and .16. The researchers do not regard the 
test as satisfactory for general use as yet. 


Hsü (38) had three women graduate students 
sort male photographs for handsomeness, and 
ten days later for judged membership in the 
Communist party. The correlations were nega- 
tive (—.50 and —.21) for the two women who 
were anti-communist and positive (.51) for the 
one woman who was relatively pro-communist. 
After reading a Time report on the blockade 
of Berlin, the correlations for sorts on a second 
set of the photographs were negative for all 
three women. While this study does not pro- 
vide in itself an adequate basis for measuring 
attitudes, it does hold out promise for more 
extensive work with character judgments from 
photographs. 


Tests Involving Miscellaneous Abilities 


Another of the excellent test approaches in 
the Horowitzs' study (37) was labeled the Cate- 
gories Test. This test was modeled directly 
upon a typical intelligence test item. Five 
pictures were presented, and the question asked 
"Which one does not belong?" As they adapted 
it, the item could be answered in more than 
one way, and the choice indicate something of 
the important social categories for the child. 
For example, a page might contain pictures 
of three white boys, one white girl, and one 
Negro boy. In this instance, the child could 
use either sex or race as the dominant category. 
Other items provided opportunities to cate- 
gorize by age and socio-economic status as well. 
Here again, the emphasis was on group norms 
rather than individual differences. 

Hartley (32) has more recently used the cate- 
gorization of photographs as a measure of eth- 
nic salience. In this study, however, the task 
was presented in a voluntary framework, in 
contrast with the objective assignment presented 
in the Categories Test. In Hartley's "faces test" 
the respondent is told to: "Sort them out into 
piles. You can make as many piles as you want 
to, or as few as you choose. You can classify 
them on any basis you want to!" 

In the Murray and Morgan "Study of Sen- 
timents" (59) several of the "indirect methods, 
methods which conceal the examiner's intent" 
provide an ability test facade. The "Sentiments 
Examination" Section A Part 1 asks for the 
"most descriptive adjectives" for 48 stimulus 
words, "the S being led to believe that the 
examiner is interested in testing the range of 
his vocabulary." In Part 3 “The S is led to be- 
lieve his verbal ability is being tested with a 
simile completion test,” eg. "As pathetic as 
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a...."In the “Arguments Completion Test” 
the respondent is asked to continue and finish 
an argument, the beginning of which has been 
described. “Being led to believe his powers of 
argumentation are being tested, the subject 
quickly becomes involved in the controversy 
he or she is inventing, and ends by exposing 
more of his own sentiments than he might 
otherwise have done” (59, pp. 58-60). 


Tests Involving Miscellaneous Judgments 


The following tests are disguised and struc- 
tured, but in part at least must be classified 
as voluntary, in so far as judgments are re- 
quired in situations in which there is no “ob- 
jective” right answer. On the other hand, they 
retain the advantage of having the respondent 
(1) work on a task presumably less threat- 
ening than the experimenter's primary prob- 
lem, and (2) report upon external values or 
realities, rather than upon himself directly. 
These studies like many already discussed, de- 
pend upon the demonstration of a judgment 
differential of which the respondent is pre- 
sumably unaware. 

In Watson's (81) "Moral Judgments" sub- 
test, judgments of approval or disapproval are 
made about a variety of situations, sets of these 
situations being identical except for the spe- 
cific persons or groups involved. For example 
unwarranted search is made of a suspected 
"radical" headquarters, on the one hand, while 
in a parallel item, the same type of search is 
carried out with a business corporation sus- 
pected of dishonesty. Scoring is done on the 
basis of discrepancy of judgment between the 
parallel situations. 

Similar in title and in plan to Watson's 
Moral Judgments Test is the ingenious ap- 
proach to Negro-White attitudes devised by 
Seeman (72). While he used equated compari- 
son groups, the technique could be modified 
for diagnosing bias in individuals. He selected 
six items from a standard test of moral evalu- 
ations on marriage and sexual matters. To one 
group, the episode items were illustrated by 
pictures of white couples; to the other, with 
Negro illustrations. Both groups were made up 
of white college students. Contrary to expec 
tation, the judgments were more lenient—less 
disapproval—with the Negro illustrations. Fur- 
thermore, when the two groups were sub- 
classified according to scores on the Likert scale 
of attitudes toward the Negro, the major part 
of the differential was contributed by the more 


tolerant persons rather than the more anti- 
Negro parts of the two groups. The intolerant 
extremes in this sample were more consistent, 
less "biased." These results are important and 
meaningful, but further indicate the danger of 
oversimplified interpretations in indirect ap- 
proaches. 

The readiness with which judgments of lit- 
erary merit can be manipulated by the substi- 
tution of fictitious authors has been demon- 
strated by Farnsworth (70) and Sherif (73). 
Prestige seems to be one of those forces which 
can bias the performance of an objective task. 
Can we use this bias to infer attitudes? Lewis 
(47) seems to have had in mind some such 
approach. The present writer (22) deliberately 
set out to utilize the prestige effect to measure 
attitude toward five minority groups. Rather 
than literary passages, proverbs or mottoes 
were used. Rather than individual authors, the 
adage was attributed to the group as a whole, 
e.g., "American pioneer saying" or "old Jewish 
motto." Eighth and ninth grade students were 
asked to evaluate the quality of each motto 
separately—there being ten mottoes attributed 
to each group. The test yielded a general preju- 
dice score that correlated but .30 with a direct 
test. Other scores were worthless. However, in 
view of the strength of prestige suggestion in 
other experiments, this approach seems worth 
trying further. It is quite possible that in the 
form given the task was trivial and the disguise 
thin. 

Horowitz (35) illustrated the development of 
patriotism and conformity among children by 
tests in which the task was to pick "the best 
looking flag," or the "best place to live in" 
from a series of photographs of houses, etc. 

As Wolff, Smith and Murray (83) have shown, 
reactions to group-disparagement jokes are 
correlated with group membership. Gordon 
(29) has recently tried to utilize this phenom- 
enon in the assessment of social attitudes. 
Twenty-four jokes, both antagonistic and sym- 
pathetic, dealing with Negroes and Jews were 
rated as to their funniness on a 5-point scale. 
Four groups of college men were used: a Prot- 
estant fraternity, a Catholic fraternity, a Negro 
fraternity, a Jewish (non-Zionist) fraternity, 
and a Zionist club. The groups differed in their 
responses to these jokes but not as anticipated 
in all instances. The jokes sympathetic to Jews 
were rated highest by the two Jewish groups— 
but the anti-Jewish jokes failed to differentiate 
the groups. The anti-Negro jokes were rated 
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lowest by the Negro group, but the pro-Negro 
jokes failed to differentiate. The various groups 
ranked the individual jokes quite similarly as 
to popularity, with rho’s between the orders 
for the different groups ranging from .71 to .94. 
With regard to individual differences within 
the various groups, the ratings of the jokes 
showed no relationship to attitudes toward 
Negroes or Jews as revealed in a direct attitudes 
test or a symbol endorsement test. The tests 
were administered as a part of a larger testing 
program. 

Cattell (13), in the program already referred 
to used a “Projection-Phantasy” test, in which 
the respondent picked from ten sets of ten 
incomplete sentences, five each which he com- 
pleted. Rather than scoring the words used in 
completing the sentences, Cattell scored the 
choice of incomplete sentences. Each set con- 
tained one representative of each of ten dif- 
ferent attitudes. Applied to the desire to excel 
in one's profession, and to the desire for more 
sleep and rest, this test had Spearman-Brown 
reliabilities of .13 and .60. Correlations with 
the paired-comparison preferences were .15 and 
31, with time and money expenditure, .01 
and .26, with information, .20 and .20. In spite 
of the relatively low level of these values, this 
method was regarded as one of the more 
promising. 

Kalpakian's “The Construction of a Dis- 
guised Test by the Use of Photographs for the 
Study of Attitudes Toward Negroes” (40) repre- 
sents an attempt at disguise which fails in part 
to meet the criteria suggested here. Like other 
tests reported in this section, a judgment dif- 
ferential of which the respondent was unaware 
was sought. Twenty-six matched pairs of pic- 
tures was selected—one set depicting Negroes, 
the other whites. The test was structured, in 
that a choice of responses, “Like—Indifferent 
—Dislike" was required for each picture. The 
instructions read in part "we're interested in 
seeing what kinds of pictures people like and 
don't like . . . we're not interested in the tech- 
nical photographic aspects of the pictures, 
rather we want your reactions to what is in the 
pictures. That is, how do you feel about what 
is in each picture.” Note the voluntary nature 
of this assignment—the respondent being told 
in effect that there are no right or wrong 
answers. Note also the absence of any clear cut 
assigned basis of judgment which might lend 
indirection (such as asking for judgments of 
beauty of the photograph or skill of photogra- 


pher). Kalpakian did aid the disguise by plac- 
ing these 52 photographs in random order 
among 68 irrelevant pictures. This is similar 
to the efforts in disguise made by those who 
distribute the items of one attitude scale among 
a variety of other types (e.g., 24). In one of 
Kalpakian’s groups, 100 subjects were asked 
what they thought the purpose of the test was. 
Only 15 did not guess that the purpose in- 
volved attitudes toward the Negro; 51 thought 
this and several other attitudes were being 
tested; 34 judged correctly that the primary 
purpose was attitudes toward Negroes. On the 
other hand only 7 were aware of the matching 
of the pictures. Scored on proportion of white 
to Negro "likes" the test showed corrected split- 
half reliability coefficients between .71 and .82 
in various white high school and college groups. 
Mean values showed a definite “white salience” 
for all groups. Scores correlated with both self 
ratings and ratings by friends, demonstrated by 
critical-ratio tests. The correlation with the 
Thurstone-Hinkley was .40, which became .80 
when corrected for restricted range. In terms 
of the categories of this review, Kalpakian’s 
test is similar to the “show me” and other pic- 
ture tests developed by Horowitz (34). It is 
structured in that a limited number of response 
categories is provided. It is direct, substituting 
pictures for the ethnic labels used in statement- 
endorsement type of direct tests. 


GENERAL CONSIDERATIONS 


Validity of Indirect Tests 


Most efforts to develop indirect attitude tests 
are predicated upon the assumption that in- 
direct tests will under certain conditions have 
higher validity than direct tests. It is worthy 
of note that none of the studies reviewed here 
offers any evidence that this is the case. The 
testing of this hypothesis is certainly the next 
step if these preliminary efforts are thought 
to be sufficiently promising to justify more 
research. 

The most usual effort at validation has in- 
volved correlating the indirect test with direct 
test results, under conditions in which the 
latter were gathered anonymously or in which 
there was no reason to suspect respondent dis- 
sembling (e.g, 19, 22, 29, 38, 40, 43, 60, 64, 
65). Hammond (31) and Gordon (29) used en- 
vironmentally selected criterion or comparison 
groups. Parrish's tests (64) meaningfully re- 
flected an experience differential between two 


THE INDIRECT ASSESSMENT OF SOCIAL ATTITUDES 175 


groups. These are, however, isolated instances, 
and even at the exploratory level, we might 
expect more use of criterion groups. 

As has been pointed out previously, the case 
for face validity for indirect tests may often 
be better than for direct tests. In a number 
of the disguised, structured tests, the distri- 
bution of scores and measures of internal con- 
sistency demonstrates unequivocally that non- 
random, systematic errors, differences in per- 
ceptions, etc. exist, of which the respondents 
are presumably unaware. These systematic un- 
conscious "biases" are well worth study in their 
own right, and seem to lie close to the func- 
tional meaning of attitude. 


Attitude Measurement as Related to 
Attitude Definition and Theory 


Research on social attitudes has been justly 
criticized for a lack of common definition of 
the concept, and for a failure to integrate defi- 
nition and measurement procedures. This 
diversity of definition has been in odd con- 
trast with the obvious similarity of research 
procedures. This paradox arises from defini- 
tional attempts which confound explanations 
of the phenomena with the process of pointing 
to the phenomena. It is the contention of the 
present writer that agreement on the implicit 
operational (or pointing) definition of attitudes 
is already present. As a tentative formulation 
the following is offered: A social attitude is (or 
is evidenced by) consistency in response to 
social objects. If we look at those definitions 
Utilizing concepts of set, or readiness to respond 
—for example, Allport's (1): "An attitude is 
a mental and neural state of readiness, orga- 
nized through experience and exerting a direc- 
tive or dynamic influence upon the individual's 
response to all objects and situations to which 
it is related"—and ask for the evidence of a 
"mental and neural state of readiness" the 
Symptoms of a "directive or dynamic influ- 
ence," criteria as to the "objects and situations 
to which it is related," these evidences will be 
in final analysis consistency or predictability 
among responses. An individual's social attitude 
is a syndrome of response consistency with 
regard to social objects. And even those whose 
behavioristic orientation leads to a rejection 
of such mentalistic definitions as Allport's— 
and who would say with Bain (3) and Horo- 
Witz (36, p. 142), "essentially . . . the attitude 
must be considered a response rather than a 
Set to respond"—in research practice do not 


equate isolated responses with attitudes, but on 
the contrary look for the appearance of re- 
sponse consistencies. This is dramatically evi- 
denced by Horowitz's (34) use of the appear- 
ance of consistent differentiated response to 
photographs of Negro and white children to 
mark the occurrence of race prejudice in 
children. 

This standpoint provides a basis for the oper- 
ational delimitation of specific and general 
attitudes. It likewise demands that if attitude 
measurement be integrated with definition, inter- 
nal consistency among the sample responses 
collected by the test must be demonstrated. 
To the present writer, the demonstra- 
tion of a single factor through factor analysis 
seems ideal, although time consuming. Spear- 
man-Brown and Kuder-Richardson (44) relia- 
bility measures, Guttman's (30) "reproducibility 
coefficient,” Loevinger's (52) “homogeneity,” 
and Thurstone’s “coefficient of irrelevance” 
(78) are likewise evidences of internal consis- 
tency. Item analysis against total scores (49, 
75), remains one of the best practical means 
of improving internal consistency. 

Contemporary American theories commonly 
hypothesize attitudes to be learned, or identify 
them with other cognitive phenomena such as 
perception and concept formation. From the 
operational definition suggested it would be 
hard to distinguish attitude from habit, which 
likewise is evidenced by behavior consistency. 
Indeed, if learning be conceived broadly 
enough to include the development of any 
response consistency, attitude formation is iden- 
tical with learning. Traditionally, learning has 
been used in those situations in which the 
response consistencies represented optimal re- 
flections of a simple, uniform environment— 
where increasing response consistency repre- 
sented the elimination of “error.” But a similar 
process goes on in “unsolvable” or highly com- 
plicated environments, and stereotyped modes 
of response are developed which are non- 
optimal reflections of the environment, which 
do not represent error-reduction. The terms 
perseveration, set, fixation, position habit, bias, 
irrational behavior, idiosyncracy, pre-judgment, 
non-random error, over simplification, bad 
habit, rigidity, all come to mind to describe 
this process. Presumably, this latter type of 
response consistency is a result of the same 
cognitive processes which under other condi- 
tions result in “right answer” learning. It is 
with these common processes that the concept 
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of attitude is identified, although typically it 
is most closely associated with the acquisition 
of “irrational” or non-adaptive response con- 
sistencies. 

In these terms it may be meaningful to 
compare direct and indirect efforts at attitude 
assessment. In both direct and indirect tests 
enough internal consistency has been demon- 
strated on some topics to justify speaking of 
attitudes. Likewise, in numerous instances re- 
ported here, considerable correlation has been 
demonstrated between responses to direct and 
to indirect tests, justifying the hypothesis that 
a common attitude lies behind them. Missing 
is the demonstration that this pattern of con- 
sistency extends beyond paper and pencil to 
the “real-life” situations which are usually in 
mind when the concept of "attitude" is used. 
While meaningful research problems exist in 
regard to patterns of consistency within a uni- 
verse of paper and pencil responses, the bulk 
of the researches reviewed are predicated upon 
the assumption that broader consistencies exist 
—that overt behavioral manifestations of the 
attitude can be predicted. 

With regard to the level of response con- 
sistency, it should be noted that direct tests 
uniformly have much higher reliability coef- 
ficients than do indirect ones, especially when 
the number of items and time of administration 
are considered. Of course this consistency is in 
part conscious, voluntary, and possibly super- 
ficial—in contrast to the involuntary "bias" in 
performance achieved by many indirect tests. 
In this sense, the indirect tests of the non- 
voluntary sort do in themselves demonstrate 
and utilize the dramatic phenomena of atti- 
tudinal interference with tasks of learning, 
perceiving, remembering, and evaluating, such 
as are summarized by Sherif and Cantril (74, 
pp. 29-91). And it is such consistencies, rather 
than consistency in voluntary self-description, 
which give vitality to current usage of the 
concept of attitude. In this, indirect attitude 
tests lie closer to contemporary attitude theory 
than do direct tests. 


SUMMARY 


The efforts at indirectly measuring social 
attitudes have been summarized. A distinction 
has been offered between disguised, non-struc- 
tured, types of test on the one hand, and dis- 
guised, structured, tests on the other. Because 
of a greater freedom from dependence upon 


the respondent's voluntary self description, the 
use of disguised, structured tests has been 
emphasized. 

As can be seen from the survey, there is 
an abundance of partially tried techniques. 
Equally good ones can be developed from the 
general formula: A plausible task, (a) which 
your respondents will all strive to do well, 
(b) which is sufficiently difficult or ambiguous 
to allow individual differences in response, and 
(c) which can be loaded with content relative 
to the attitude you seek to measure. Test the 
responses of individuals for persistent selectivity. 
in performance, for correlated or non-random 
errors. For example, several persons have sug- 
gested to the present writer what might be 
called the “photocrime” test. Here criminal 
scenes would be pictured in photograph and 
text. Sufficient ambiguity would be provided 
to make it a challenging problem to the ama- 
teur detective, and, incidentally, to allow the 
possibility of bias in the selection of the crim- 
inal from the suspects of various ethnic or 
social identification. A test of "common legal 
knowledge" might well provide a framework 
for carrying further some of the notions in 
Watson's "Moral Judgments” sub-test. The 
respondent might be asked to decide whether 
or not civil liberties had been violated in a 
wide range of specific situations—might be 
asked to decide between claimant and defen- 
dant in a number of civil suits. Here the con- 
tent could easily be manipulated to get at 
almost any social attitude one might be inter- 
ested in. It should not be too difficult to keep 
the respondent convinced that he dealt solely 
with matters of the law. We are prone to see 
as propaganda that with which we disagree. 
A "propaganda analysis test" should reveal in- 
directly some people's attitudes. For the build- 
up, give a review of some principles on how 
to spot propaganda, how to detect an ulterior 
motive, a selfish interest group, deceit, exaggera- 
tion, etc. Then have the respondent test his 
skill on a number of argumentative paragraphs, 
which he evaluates as to the extent and kind of 
propaganda present. The root of this notion 
may be found in Watson's "Arguments" sub-test. 

Such devices may be multiplied, the reader 
can no doubt provide better ones. In some of 
these, the fiction of an objective test situation 
will be hollow indeed. In others, for example 
in the estimation of group opinion, we are tap- 
ping biased performance in an everyday act 
of social perception. We are not taking at face 
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value the respondent's own volunteered descrip- 
tion of himself. Rather we are studying sys- 
tematic errors in the respondent’s own percep- 
tions, errors of which he himself is not aware. 
Not only in attitude testing but in personality 
study in general, distortions of performance in 
dealing with the environment provide objec- 
tive evidence of an individual’s unique picture 
of his world. Just as the psychophysicists were 


BIBLIOGRAPHY 


long ago forced to introduce Vexirwersuche, or 
catch trials, into introspection—to eliminate 
impossible negative reaction times and infin- 
itesimal two-point thresholds—so personality 
testing, whether check-list or free-response, 
must rise above dependence on voluntary self- 
description. The development of structured 
indirect attitude tests is thought to be a step 
in this direction. 


l1. Allport, G. W. Attitudes. Ch. 17 in C. Murchison (Ed), 4 handbook of social psychology. 


Worcester, Mass.: Clark University Press, 1935. 


2. Allport, G. W., and Postman, L. The psychology of rumor. New York: Holt, 1947. 

3. Bain, R. An attitude on attitude research. Amer. J. Sociol., 1928, 33, 940-957. 

4. Bartlett, F. C. The study of society. London: Kegan Paul, 1939. 

5. Bartlett, F. C, The cooperation of social groups: a preliminary report and suggestions. 


Occup. Psychol., 1938, 12, 30-42. 


6. Brown, J. F. Modification of the Rosenzweig picture frustration test to study hostile inter- 


racial attitudes. J. Psychol., 1947, 24, 247-272. 


7. Burtt, H. E. Measuring interests objectively. Sch. and Soc., 1923, 17, 444-448. 
8. Burtt, H. E. Principles of employment psychology (Rev. Ed.). New York: Harper, 1942. 
9. Carmichael, D. M. The cooperation of social groups. Brit. J. Psychol., 1938, 29, 206-231; 329- 


344. 


10. Cattell, R. B. A guide to mental testing. London: Univ. London Press, 1936. 


1l. Cattell, R. B. Projection and the design of projective tests of personalit 


Pers., 1944, 12, 177-194. 


y. Charact. and 


12. Cattell, R. B., Heist, A. B., Heist, P. A., and Stewart, R. G. The objective measurement of 
dynamic traits. (To be published in Educ. psychol. Msmt., 1950.) 

18. Cattell, R. B., Maxwell, E. F., Light, B. H., & Unger, M. P. The objective measurement of 
attitudes, (To be published in the Brit. J. Psychol., 50, 1950.) 

14. Coffin, T. E. Some conditions of suggestion and suggestibility: a study of certain attitudinal 
and situational factors influencing the process of suggestion. Psychol. Monogr. 1941, 4, 


No. 241. 


15. College Study in Intergroup Relations. (Lloyd Allen Cook, Director. Study forms and tech- 


nics in intergroup relations. Supplementary Sheet No. 5, Jan. 


troit: Wayne University. 


16. Commission on Community Interrelations of the 
Research Director.) The face game. Unpublished 
originally developed by Marian Radke. 1947-1948. 


1948. (Mimeographed.) De- 


American Jewish Congress. (Stuart Cook, 
reasearch on a modified version of a test 


17. Davis, T. E. Some racial attitudes of Negro college and grade school students. J. Negro 


Educ., 1987, 6, 157-165. 


18. Deri, S., Dinnerstein, D., Harding, J., and Pepitone, 
measurement of intergroup attitudes and behavior. 
19. Dubin, S. S, Verbal attitude scores predicted from responses in a 


ciometry, 1940, 3, 24-28. 


20. Edwards, A. L, Political frames of references as 


soc. Psychol., 1948, 36, 34-61. 


A. D. Techniques for the diagnosis and 
Psychol. Bull., 1948, 45, 248-271. 
projective technique. So- 


a factor influencing recognition. J. abnorm. 


21. Evans, M. C., and Chein, I. The movie story game: a projective test of interracial attitudes 


for use with Negro and white children. Paper read at the 56 


th annual meeting of the Amer- 


ican Psychological Association, Boston, Sept. 8, 1948. 


22. Frenkel-Brunswik, E., Jones, H. E. (Di 


rectors), and Rokeach, M., Jarvik, M., and Campbell, 


D. T. (Staff), Unpublished research on the personality correlates of anti-minority attitudes 


among grade school children. A project 


of the University of California Institute of Child 


Welfare, financed by a grant from the American Jewish Committee, 1946-1947. 


23. Frenkel.Brunswik, E., and Reichert, S. Persona 


manuscript.) 


lity and prejudice in women. (Unpublished 


178 
24. 


25. 
26. 
27. 
28. 
29. 


30. 
31. 


38. 


39. 


52. 


Sz 


ATTITUDE MEASUREMENT 


Frenkel-Brunswik, E., Levinson, D., and Sanford, R. N. The anti-democratic personality. In 
T. M. Newcomb and E. L. Hartley (Eds.), Readings in social psychology. New York: Holt, 
1947. 

Fromme, A. On use of qualitative methods of attitude research. J. soc. Psychol., 1941, 13, 
429-460. 

Fryer, D. The measurement of interests. New York: Holt, 1931. 

Gilbert, H. H. Secondary science and pupil prejudice. J. educ. res. 1941, 35, 294-299, 

Glazer, E. M. An experiment in the development of critical thinking. Teach. Coll. Contr, 
Educ., No, 843. New York: Bureau of Publications, Teachers Coll., Columbia Univ., 1941. 
Gordon, S. Exploration of social attitudes through humor. Master's Thesis, Univ. of Illinois, 
1947. 

Guttman, L. A basis for scaling qualitative data. Amer. sociol. Rev., 1944, 9, 139-150. 
Hammond, K. R. Measuring attitudes by error-choice; an indirect method. J. abnorm. soc. 
Psychol., 1948, 43, 38-48. 


. Hartley, E. L. Problems in prejudice. New York: Kings Crown Press, 1946. 


Hartley, E. L., and Schwartz, S. A pictorial-doll play approach for the study of children's 
intergroup attitudes. Mimeographed preliminary draft. Research Institute in American Jew- 
ish Education, American Jewish Committee, Summer, 1948. 


. Horowitz, E. L. The development of attitude toward the Negro. Arch. Psychol., N. Y., 1936, 


28, No. 194. 


. Horowitz, E. L. Some aspects of the development of patriotism in children. Sociometry, 


1940, 3, 329-341. 


. Horowitz, E. L. "Race" attitudes. In O. Klineberg (Ed), Characteristics of the American 


Negro. New York: Harper, 1944. 


- Horowitz, E. L., and Horowitz, Ruth E. Development of social attitudes in children. Soci- 


ometry, 1938, 1, 301-338. 

Hsü, E. H. An experimental study of rationalization. J. abnorm. soc. Psychol, 1949, 44 
277-278. 

Johnson, G. G. An experimental analysis of the origin and development of racial attitudes 
with special emphasis on the role of bilingualism. Ph.D. Thesis, Univ. of Colorado, 1949. 
Kalpakian, E. Y. The construction of a disguised test by use of photographs for the study 
of attitudes toward Negroes, Master's Thesis, Clark University, 1947. (Clark University 
Bulletin. Abstracts of Dissertations and Theses, Worcester, Mass., Oct., 1947.) 


- Katz, Martin R. A hypothesis on anti-Negro prejudice. Amer. J. Sociol., 1947, 58, 100-104. 
. Krech, D., and Crutchfield, R. S. Theory and problems of social psychology, New York: 


McGraw-Hill, 1948. 


. Kremen, E. O. An attempt to ameliorate hostility toward the Negro through role playing. 


Master's Thesis, The Ohio State University, 1949, 


» Kuder, G. F., and Richardson, M. W. The theory of the estimation of test reliability, Psy- 


chometrika, 1937, 2, 151-160, 


» Landsdell, H. A study of distorted syllogistic reasoning as a means of discovering covert 


attitudes toward marriage. Bull. Canad. Psychol. Assn., 1946, 6, 98 (Abstract.) 


Lefford, A. The influence of emotional subject matter on logical reasoning. J. gen. Psychol., 
1946, 34, 127-151. 


Lewis, H. B. An approach to attitude measurement. Psychol. League J., 1998, 2, 64-67. 


. Levine, J. M., and Murphy, G. The learning and forgetting of controversial material. J. 


abnorm. soc, Psychol., 1943, 38, 507-517. 


. Likert, R. A technique for the measurement of attitudes. Arch. Psychol., N. Y., No. 140, 


1932. 


Locblowitz-Lennard, H., and Riessman, F., Jr. A proposed projective attitude test. Psychi- 
atry, 1946, 9, 67-68. 


+ Loeblowitz-Lennard, H., and Riessman, F., Jr. A preliminary report on social perception 


test—a new approach to attitude research. Social Forces, 1946, 24, 423-427. 

Loevinger, J. The technic of homogeneous tests compared with some aspects of “scale anal- 
ysis” and factor analysis. Psychol. Bull., 1948, 45, 507-529. 

McGregor, D. The major determinants of the prediction of social events. J. abnorm. soc. 
Psychol., 1938, 88, 179-204. 


- McNemar, Q. Opinion-attitude methodology. Psychol. Bull, 1946, 43, 289-374. 
- Morgan, J. J. B. Distorted reasoning as an index of public opinion. Sch. and Soc., 1943, 57, 


333-335. 


. Sweet, L. The measurement of personal attitudes in youn, 


. Travers, R. M. W. A study in judging the opinions of groups. 


. Watson, G. B. The measurement of fair-mindedness. Teci 
. Williams, Robin M., Jr. The reduction of intergroup t 


. Wolff, H. A., Smith, C. E., and Murray, 


THE INDIRECT ASSESSMENT OF SOCIAL ATTITUDES 


. Morgan, J. J. B., and Morton, J. T. The distortion of syllogistic reasoning produced by 


personal convictions. J. soc. Psychol., 1944, 20, 39-59. 


. Morgan, J. J. B. Attitudes of students toward the Japanese. J. soc. Psychol., 1945, 21, 219-246. 
. Murray, H. A. Explorations in personality. New York: Oxford Univ. Press, 1938. 
. Murray, H. A., and Morgan, C. D. A clinical study of sentiments. I and II. Genet. Psychol. 


Monogr., 1945, 32, 3-311. 


. Murphy, G., and Likert, R. Public opinion and the individual. New York: Harpers, 1937. 
. Newcomb, T. M. Personality and social change, New York: Dryden Press, 1943. 
. Newcomb, T. M. The influence of attitude climate upon some determinants of information. 


J. abnorm. soc. Psychol., 1946, 41, 291-302. 


. Noland, E. W. Factors associated with absenteeism in a south central New York State in- 


dustry. Ph.D. thesis on file in the Cornell University Library, 1944. 


. Parrish, J. A. The direct and indirect assessment of attitudes as influenced by propagandized 


radio transcriptions. Master's thesis, The Ohio State University, 1948. 


. Proshansky, H. A projective method for the study of attitudes. J. abnorm. soc. Psychol., 


1943, 38, 383-395. 


. Robbins, I. Point of view and quality of thought in attitude measurement. Improving Edu- 


cational Research, pp. 52-56. American Educ. Res. Ass. 1948 Official Report, Washington, 
D. C. 


. Rose, Arnold. Studies in reduction of prejudice. Chicago: American Council on Race Rela- 


tions, 1948. 


. Rotter, J. B., and Willerman, B. The incomplete sentences tests as a method of studying 


personality. J. consult. Psychol., 1947, 11, 43-48. 


. Ruch, F. L. Psychology and life. (Ist Ed.). Chicago: Scott, Foresman and Company, 1937. 
. Saadi, M., and Farnsworth, P. R. The degrees of acceptance of dogmatic statements and 


preferences for their supposed makers. J. abnorm. soc. Psychol., 1934, 29, 143-150. 


. Seeleman, V. The influence of attitude upon the remembering of pictorial material. Arch. 


Psychol., N. Y., 1940-1941, 36, No. 258. 


. Seeman, M. Moral judgments: a study in racial frames of references, Amer. sociol. Rev., 


1947, 12, 404-411. 


. Sherif, M. A study of some social factors in perception. Arch. Psychol., N. Y., 1935, No. 187. 
. Sherif, M., and Cantril, H. The psychology of ego-involvements. New York: Wiley, 1947. 
. Sletto, R. F. Construction of personality scales by the criterion of internal consistency. Han- 


over: The Sociological Press, 1937. 


. Smith, G. H. Beliefs in statements labeled fact and rumor. J. abnorm. soc. Psychol., 1947, 


42, 80-90. Á 
ger boys. New York: The Associa- 


tion Press, 1929, 


. Thurstone, L. L., and Chave, E. J. The measurement of attitude. Chicago: Univ. Chicago 


Press, 1929. 
ress Arch. Psychol., N. Y., No. 
266, 1941. 


Wallen, R. Individuals’ estimates of group opinion. J. soc. Psychol., 1943, 17, 269-274. 


h. Coll. Contr. Educ. No. 176. New 


York: T 1l., Columbia Univ., 1925. 
m ÓÓ ensions. New York: Social Science 


Research Council Bull, No. 57, 1947. 
H. A. The psychology of humor; l. a study of race 


i j -865. 
disparagement jokes. J. abnorm. soc. Psychol., 1934, 28, 341 
Zeligs, R. Racial attitudes of children, Social. soc. Res., 1987, 21, 361-971. 


179 


20. The Identification of Prejudicial Attitudes by the 


Galvanic Skin Response 


JOSEPH B. COOPER and DAVID POLLOCK 


A. INTRODUCTION 


This study was designed as an extension of 
two previous studies (2, 3) which provided 
physiological evidence in support of the view 
that prejudicial-attitudes are affectively forti- 


In the first of these studies, statements which 
described certain ethnic and national groups 
for which individual subjects had previously 
and variously expressed antipathy, neutrality, 
and affinity, by rating and ranking scale means, 
were read to those subjects in individual ex- 
perimental sessions. GSRs to derogatory state- 
ments in reference to affinity groups were 
greater than to neutral groups in 14 of 20 
cases; p < .0594. GSRs to complimentary state- 
ments in reference to antipathy groups were 
greater than to neutral groups in 19 of 20 cases; 
p < 00007. 

The second study was a replication of the 
first, with attention solely directed to stimulus 
groups held in antipathy. In this study GSRs 
were greater to complimentary statements in 
reference to antipathy groups than in reference 
to neutral groups in 20 of 23 cases; p < .0005. 

The design of the present study was essen- 
tially the reverse of the first two. In this study 
GSRs were recorded first. From these, predic- 
tions were then made of attitude content, as 
measured by paper and pencil scaling means. 
* 

B. THE PROBLEM 

Whereas previously it had been found that 
individuals who expressed intense dislike for 
particular groups emitted excessively large 
GSRs to complimentary statements in reference 


* Reprinted from Journal of Social Psychology, 
1959, 50, 241-245, with permission of the senior 
author and the publisher, 


to those groups, it had still not been deter- 
mined if relatively large GSRs were predictive 
of prejudicial-attitudes for an unselected 
sample. The question was: Does an individual's 
relatively large affective response to a compli- 
mentary statement in reference to a group 
indicate a negative content prejudicial-attitude 
toward that group? Operationally, affectivity 
level is described by GSR and prejudicial- 
attitude is described by paper and pencil 
scaling. 

In view of the theoretical formulation of the 
problem and results of the two previously men- 
tioned studies, an hypothesis was formulated. 
Its theoretical statement was: Display of exces 
sive affectivity in response to a complementary 
description of a national or ethnic group is 
predictive of an antipathetic prejudicial-atti- 
tude toward that group. Its operational state- 
ment was: A relatively great GSR to a compli- 
mentary verbal description of a national or 
ethnic group is predictive of a relatively low 
attitude scale position for that group. 


C. EQUIPMENT, SUBJECTS, AND 
PROCEDURE 


The GSR unit of a Keeler Polygraph (Model 
502) was used. This unit is equipped with a 
kymograph mechanism which carries coördi- 
nate-line paper upon which a pen records GSR 
magnitude. Rhodium palmar electrodes were 
used. 

Attitudes were scaled by paired-comparison. 
The paired comparison scale contained the 
names of the following groups: Austrian, Cana- 
dian, English, German, Irish, Japanese, Jew, 
Mexican, Pole, Swede. The test-retest Spearman 
r reliability median was .96. 

Subjects were students in two sections of 


general psychology. They were divided approxi- 
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mately evenly as to sex. Seventy-two subjects 
were given GSR tests. Of these, 19 did not re- 
spond sufficiently for inclusion within the 
sample. The sample N was 53. 

Subjects were given appointment slips, each 
identified by number, not by name. They ap- 
peared for individual experimental sessions 
during class periods, each session requiring 
approximately 15 minutes. 

Upon coming into the experimental room, 
the subject was shown the polygraph, and its 
basic characteristics were explained. The ex- 
periment itself was explained as a test of the 
GSR as a function of standard stimuli which 
were being read to all subjects. The subject's 
anonymity was assured. The subject was seated 
in a comfortable, wide-arm chair to the left of 
the polygraph. He could see the polygraphy 
peripherally, but could not follow the pen trac- 
ings. The experimenter stood to the right-rear 
of the subject. After placement of the electrodes 
the subject was "balanced in" by use of stan- 
dard stimuli such as a hand-clap, touch on the 
neck, or a loudly spoken word. Final "balanc- 
ing in" was accomplished while reading a com- 
plimentary statement which referred always to 
Irish. Thus, in the experiment proper, 9 groups 
were used for stimulus material. 

The order of presentation of the stimulus 
statements was standard except for Nos. 3, 5, 
and 7. Since it had already been found (1) 
that Japanese, Jews, and Mexicans were groups 
toward which relatively great antipathy is 
shown by the local college population, they 
were randomized within these positions. As 
well, the complimentary statements descriptive 
of them were randomized. Thus, these three 
groups and the statements which described 
them were randomized in these three positions. 

The group description statements were all 
of equal length, a blank at the second word 
position being left for insertion of the name 
of the appropriate group. As an example, one 
Statement was: “The are people it 
would be profitable to imitate, for they are 
superior both intellectually and morally.” 

The experimenter noted on the kymograph 
paper the beginning of the reading of a state- 
ment. After the pen had come to or about the 
base-line for 10 seconds another mark was 
made. Such a recorded limitation described a 
GSR. Thus, the area under the curve was evalu- 
ated as the summation of the rectangular spaces 
which were described by the parameter. In 
that there seems to be no accurate means for 


comparing GSRs from one subject to another, 
comparisons were made intra-subject, not inter- 
subject. 


D. RESULTS 


Four tests of the hypothesis were made. Each 
was a test of data accumulated by way of pro- 
cedures just described. 


1. The First Test 


With reference to the three stimulus groups 
which were randomly rotated (Japanese, Jews, 
Mexicans), it was predicted that subjects whose 
GSRs to them were above their individual GSR 
means would rank them below the paired com- 
parison median. For each of the three groups 
the X? test for two independent samples (4, 
p. 109) was used. They were: 8.54, 6.73, and 
9.48 respectively. In each case p < .005. 


2. The Second Test 


Each subject’s greatest GSR was identified 
now with reference to all 9 stimulus groups. 
The prediction was made that the group to 
which the greatest GSR was made would be 
ranked by the subject below the paired-compar- 
ison scale median. Here, 43 of the 53 subjects 
so ranked their greatest GSR groups. The bi- 
nomial test (4, p. 41) result was: z — —441, 
p < 90003. 


3. The Third Test 


"This test was the same as the second except 
that instead of the single greatest GSR, the two 
greatest were here identified for each subject. 
It was predicted that the subject would rank 
both of these groups below the paired-compari- 
son scale median. In this instance 41 of the 53 
subjects responded as predicted: z =—3.85, 
p < .00007. 


4, The Fourth Test 

The 9 stimulus groups were ranked for the 
sample according to GSR magnitude and ac- 
cording to paired-comparison position. It was 
predicted that a tantial correlation between 
GSR magnitudes and paired-comparison rank 
positions would obtain. The Spearman rank co- 
efficient was .82. 


E. SUMMARY AND CONCLUSIONS 


This study was the third in a series designed 
to test the prevalently held view that when prej- 
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udicial-attitudes are brought into play they are 
affectively supported. The first two studies (2, 3) 
provided evidence in support of this view. 
These studies showed a consistent relationship 
between prejudicial-attitude strength and affec- 
tivity level. Subjects used in these first two 
studies were only those who had already indi- 
cated relatively strong prejudicial-attitudes to- 
ward particular social groups by way of rating 
and ranking scale performance. 

The present study, the third, was designed as 
a “check” on the first two, and differed from 
them in two principal ways. First, the sample 
was random, in the sense that information with 
reference to subjects’ prejudicial-attitudes had 
nothing to do with their selection. Second, 
GSRs were recorded first and paired-comparison 
scales were administered later. In the latter re- 
spect, the third study was a reversal of the pro- 
cedure used in the first two. 

In individual experimental sessions, 9 com- 
plimentary statements were read to subjects. 
Each statement referred to an ethnic or 
national group. The GSR was automatically re- 
corded for each statement. At a later time sub- 
jects completed a paired-comparison scale which 
ranked the groups with reference to preference. 

It was predicted that relatively great affec- 
tivity (as measured by relatively great GSR) 
would be indicative of relatively great anti- 
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21. The AB Scales: An Operational Definition of Belief 


and Attitude 


MARTIN FISHBEIN and BERTRAM H. RAVEN 


Many social psychologists have seen fit to dis- 
tinguish between “belief” and “attitude.” Gen- 
erally, cognitive aspects have been attributed to 
“beliefs” and affective or motivational aspects 
have been attributed to “attitudes” (Allport, 
1954; Campbell, 1961; Katz and Stotland, 
1959; Krech and Crutchfield, 1948), This paper 
attempts further to clarify these two concepts 
by providing, what we consider to be, a much- 
needed operational distinction between “belief” 
and “attitude.” 

In a study stimulated by Festinger’s (1957) 
theory of cognitive dissonance, it appeared that 
an individual's conceptions of extra-sensory per- 
ception could be affected along several dimen- 
sions (Raven and Fishbein, 1961). Pressures 
could be exerted upon the S’s belief in ESP— 
"What is the probability that such a phenom- 
enon does exist?" "Is it existent or non-exis- 
tent?"—or upon his evaluation of ESP—"Is ESP 
'good' or 'bad'?" The latter, consistent with Os- 
good et al. (Osgood, 1952; Osgood and Tannen- 
baum, 1955; Osgood, Suci, and Tannenbaum, 
1957; Doob, 1947; and Rhine, 1958), can be 
seen as the evaluative dimension of a concept, 
or more specifically, as an “attitude.” The for- 
mer dimension could then be considered as the 
probability dimension of a concept, or more 
specifically, as a “belief.” Having arrived at 
these definitions, it was necessary to determine 
whether the two dimensions could be measured 
and manipulated independently. 


E 
CONSTRUCTION OF THE INSTRUMENT 


Fortunately, Osgood et al. (1957) had already 
demonstrated that a reliable and valid measure 


. Reprinted from Human Relations, 1962, 15, 35- 
44, with permission of the publisher. This study 
was supported by the Group Psychology Branch of 


of attitude, i.e., the evaluative dimension, could 
be obtained by having subjects check positions 
on the series of seven-place bipolar adjective 
scales, e.g., "good-bad," "clean-dirty," et cetera. 
Since the reliability of the semantic differential 
had already been demonstrated, it seemed that 
the same technique could be used to measure 
the probability dimension. Thus the instrument 
utilized in measuring both dimensions was a 
modified form of Osgood's semantic differential. 

The adjective pairs making up the final in- 
strument were determined empirically by asking 
subjects to take prescribed orientations in re- 
sponding to an adjective-rating task. Twenty-two 
pairs of polar adjectives, representing the two 
dimensions, were selected on a priori grounds. 
These adjectives were given to fifteen graduate 
psychology students, randomly assigned to four 
groups. These respondents were asked to rate 
the concept “ESP” on the polar adjective scales. 

Members of Group I were told to rate the 
concept (ESP) the way they thought it would 
be rated by someone who believed that extra- 
sensory perception existed, and who felt that 
ESP was a good thing. Members of Group II 
were told to give their ratings the way a person 
with a strong negative belief and a positive atti- 
tude would rate the concept; Group III re- 
ceived instructions to estimate the ratings of an 
individual with a strong positive belief and 
negative attitude; while Group IV was told to 
give ratings for x individual with strong nega- 
tive belief and negative attitude. 

In this case, as in the others reported herein 
where semantic differential ratings were used, 
Ss were given the standard instructions for 
using the semantic differential form (Osgood, 
Suci, and Tannenbaum, 1957). As in Osgood's 
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(Concept to be rated is inserted here) 


3 rational $ i intuitive 
MGR Rae i ^ SRL RES. ^ benefücial 
A wise TER Vengo n. Br Modlih 
EE NV. Dux CU P: dan 
successful E t ] ^ : ‘ae unsuccessful 

LEE AL . .  .. M . poni 
educated ip iMi nA dx ner ignorant 
cruel 0c uhi MER UMS 
graceful EE xn Tom awkward 
potent ST ako ibm ad TTE impotent 

(he MMC NM: ME 
active L1 e aie aN A Nato passive 

B existent ms NrW ar MEA) g nonexistent 

A bad eee i A ON ee 

B probable Sec SD. GÉ QU cuo Ee RU improbable 
skeptical = Le ite ors ciao believing 

oso) SS n co MR Tre Th likely 
honest BENE ues ees "MEUSE dishonest 

A sick EN UWaM 1L que Spa cai 
strong ia MRM sal weak 


Figure 1. The AB Scales. The scales measuring attitude are marked with A in 
the left column; those measuring belief are marked B. The direction of scoring 
is also indicated with +3 and —3. None of these was indicated on the forms 


actually given to the subjects. 


form, each scale had three degrees of positive, 
three degrees of negative, and a neutral point! 
(see Figure 1). 

In analyzing the data, adjective pairs that did 
not clearly distinguish between belief and atti- 
tude were eliminated. That is, if a particular 
scale was scored positive in Groups I and II 
(favorable attitude groups) and negative in 
Groups III and IV (unfavorable attitude 
groups), then it was considered a good attitude 
scale. Similarly, a scale receiving positive scores 


1 Actually, in scoring the scales, it is easier to assi 
values from “1” to “7,” with «4" n 


from the total score. We have presented our tables 
in the latter form, since it more clearly represents 
the directional aspects of beliefs and attitudes. 


in Groups I and III (positive belief groups), 
and negative scores in Groups II and IV (nega- 
tive belief groups), was considered a good belief 
scale. Those scales that yielded positive or nega- 
tive scores in three or more groups were dis- 
carded. 

With the scales thus partially refined, the 
procedure was repeated with a new sample of 
19 undergraduate psychology students, five Ss 
each in Groups I, II, and III, and four Ss in 
Group IV. Again, nondiscriminating items were 
discarded. > 

We were now left with the final instrument, 
which we shall call the AB Scales (see Figure 1). 
This instrument consists of five belief items 
(probable-improbable; possible-impossible; 
likely-unlikely; existent-nonexistent; and true- 
false), five attitude items (good-bad; sick-healthy: 
harmful-beneficial; wise-foolish; and clean-dirty), 
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and ten filler items which serve somewhat to 
disguise the purpose of the instrument. 

As an additional test, the instrument was 
given to another sample of twenty undergradu- 
ate Ss, five in each of the four role-play condi- 
tions. With five adjective pairs each, attitudes 
and beliefs could both vary between —15 and 
+15. Table 1 indicates that the scales did in- 
deed differentiate quite clearly between the four 
role-play conditions. 


TABLE 1. Mean Attitude and Belief Scores 


for Role-Play Groups* 
Group Group Group Group 
Scale I I IH IV 
Attitude +4102 411.2  —122 —13.0 
Belief 4150  —142 4118  —150 


* Scores could range from —15 to +15. The 
four groups were asked to role-play respectively: 
I. High belief, positive attitude; II. Low belief, 
positive attitude; III. High belief, negative at- 
titude; IV. Low belief, negative attitude. There 
were five Ss in each group. 


Needless to say, the role-play conditions could 
not be considered as a reasonable test of the 
reliability or validity of our scales. Though the 
role-playing served as a useful tool in the selec- 
tion of scale items, the actual tests of reliability 
and validity were conducted in non-role-playing 
situations. It should be pointed out that other 
adjective pairs might also have been used to 
differentiate between belief and attitude. For 
example, in the first undergraduate sample, 
"high-low" and "kind-cruel" were also found to 
be good attitude scales. Since we had arbitrarily 
decided to limit the scale to five adjective pairs 
for each dimension, we selected those which 
seemed most sensitive, while recognizing the 
possibility of including a greater number of ad- 
jective pairs for even greater precision. 


RELIABILITY, GENERALIZABILITY, AND 
INTER-ITEM CORRELATIONS 


Although the reliability of Osgood's semantic 
differential had already been demonstrated, it 
was felt that a specific test of the reliability of 
these scales was needed. Forty-three Ss rated 
the concept ESP on the AB Scales during one 
of their regular class sessions, and re-rated the 
concept after a four-day delay, in the same cir- 
cumstances. For these 43 subjects, the correla- 
tion between attitude scores was .900, while that 


for belief scores was .908. Both correlations are 
significant well beyond the .01 level. 

To test whether the independence of belief 
and attitude was specific to ESP, we presented 
the AB Scales to a fifth sample of 121 subjects. 
All Ss were asked to rate the concepts “extra- 
sensory perception," “atomic fallout," and “ra- 
cial prejudice." The correlations between belief 
and attitude for the concepts were —.168, 
—4069, and +.120 respectively. Thus it appears 
that the independence of the belief and attitude 
scales is not specific to ESP. It remains for fur- 
ther empirical investigations to determine how 
general such independence is in relation to 
other concepts, but it appears unlikely that be- 
lief in the existence of any object should corre- 
late with the attitude toward it. 

Since Osgood had empirically determined the 
evaluative dimension through factoranalytic 
techniques, there was little question about the 
inter-correlations of the adjective pairs compris- 
ing the A Scale. However, the B Scale was de- 
termined through a role-play procedure, and 
certain of Osgood’s evaluative adjective pairs 
were eliminated from the A Scale because they 
failed to discriminate between the two dimen- 
sions. It therefore seemed necessary to provide 
inter-item correlations for both the A and the 
B Scales. 

Following the technique of Osgood, Suci, and 
Tannenbaum (1957) for obtaining an inter- 
correlation matrix of adjective pairs for more 
than one concept, correlations were obtained 
for 53 subjects’ ratings of the concepts "extra- 
sensory perception," “atomic fallout,” and “ra- 
cial prejudice." The inter-correlations of the 
adjective pairs comprising the A Scale were all 
significant at less than the .01 level, with the 
range being from .638 to .858. Similarly, all the 
inter-correlations of the adjective pairs compris- 
ing the B Scale were significant at less than the 
.01 level, the range being from .399 to .798. 
Inter-correlations between adjective pairs of the 
A and B Scales were generally nonsignificant, 
although a few significant negative correlations 
were obtained. 


A VALIDITY TEST OF THE AB SCALES 


To test the validity of the AB Scales, an at- 
tempt was made selectively to alter subjects’ 
beliefs or attitudes through the method of dif- 
ferential communications. Four different arti- 
cles approximately one page in length were 
used. These were presented as excerpts from 
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actual articles. A different article was given to 
each of four groups. The groups and commu- 
nications were: 


l. Positive belief group—the communication 
stressed phenomena that could be "explained" 
only by ESP. 

2. Negative belief group—the communica- 
tion stressed the lack of controls and poor ex- 
perimental design in ESP experiments. 

3. Positive attitude group—the communica- 
tion stressed the values of ESP. 

4. Negative attitude group—the communica- 
tion stressed the dangers of ESP. 


In composing the articles, care was taken to 
avoid indication by the author of his own belief 
or disbelief in ESP, in the attitude-change com- 
munications. Similarly, indication of attitude 
in the belief-change communications was 
avoided. There were 41 introductory psychology 
students in the study of belief change and 36 
introductory psychology students in the attitude- 
change study. 

In the two experiments identical procedures 
were used. After being given Osgood's standard 
instructions for using the semantic differential, 
the subjects rated the concept "extra-sensory 
perception" on the AB Scales. The subjects in 
each experiment were then randomly divided 
into two groups, with one group receiving the 


positive article (for belief in Experiment I or 
attitude in Experiment II) and the other group 
receiving the negative article. After reading the 
material, all Ss once again rated the concept 
“extra-sensory perception” on the AB Scales. 
The actual changes in response to differential 
communications should yield more acceptable 
evidence of the validity of the scales. 

From Table 2 it can be seen that attitudes 
were changed differentially as a function of 
communications designed to change attitudes 
(x? = 6.18, p = .02). Similarly, the belief com- 
munications had differential effects on beliefs 
(x? = 10.70, p = .01). The table indicates that 
there was also some change in beliefs as a func- 
tion of attitude communication, and some 
change in attitudes as a function of belief com- 
munication, but neither of these differences was 
significant. 

Another set of findings was observed in Ex- 
periment I, the groups which received the be- 
lief communication. The results were: (a) the 
pretest means and standard deviations of the 
two groups vis-à-vis both belief and attitude 
were not significantly different from one an- 
other; (b) however, there was a positive correla- 
tion of .49 (significant at the .05 level) between 
belief and attitude in the negative group prior 
to the communication; (c) the correlation was 
maintained after the communication; and (d) 


TABLE 2. Number of Subjects Changing Beliefs and Attitudes on AB Scales 
Following Communications 


Frequency Change 


Frequency Change 


in Belief in Attitude 
_ Type of Posi- No Nega- Posi- No Nega- 
Communication tive Change tive tive Change tive 


A. Communications Designed to Change Beliefs 


Positive 18 2 4 6 4 9 
Negative 4 1 17 2 4 16 
x? (1 df) = 10.695* x? (1 df) = 1.77* 
p=.0l pns. 
B. Communications Designed to Change Attitudes 
Positive 8 5 5 9 1 8 
Negative 4 3 11 1 1 16 
x? (1 df) = 2.81* x (1 df) = 6.13* 
p= ns. p=.02 


———————————————D 


* "Positive" and "no change" cells combined. Chi-squares were corrected for 
continuity. P-values are the same when "negative" and "no change" cells are 


combined. 


i 
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therefore, in addition to obtaining a significant 
belief change, there was also a significant attitude 
change in the negative group (p —.008, two- 
tailed sign test). Thus, though the above data 
demonstrated that it is possible independently 
to manipulate beliefs and attitudes, as here de- 
fined, and that one can obtain non-correlated 
measures, it seems that there are some situations 
in which the two may be correlated. This is of 
much theoretical interest in its own right, and 
should be investigated further. 


DISCUSSION 


If our independent manipulation of belief 
and attitude seems inconsistent with other 
treatments of these concepts in the literature 
(e.g, Campbell, 1950; Campbell, 1961; Carl- 
son, 1956; Newcomb, 1953; Rosenberg, 1958, 
1960), it would be well to keep in mind the 
specific definitions of "belief" and "attitude" 
which have been utilized thus far in this paper. 
In prior discussions, "attitude" was often de- 
fined as including "belief." For example, Krech 
and Crutchfield (1948) define belief as "an en- 
during organization of perceptions and cogni- 
tions about some aspect of the individual's 
world," and attitude as "an enduring organiza- 
tion of motivational, emotional, perceptual and 
cognitive processes with respect to some aspect 
of the individual's world." Accordingly, any 
change in the cognitive aspect of some attitu- 
dinal object would, by their definition, entail 
à change in the attitude toward that object. 
They would also consider some beliefs which 
do not involve any evaluation. When defined 
as the probability dimension, a belief can 
change independently of an attitude. Further, 
two individuals may differ in belief but have 
similar attitudes, or vice versa. In addition, with 
respect to any object of belief, it is possible to 
obtain a measure of the evaluative dimension, 
or attitude, toward that object; such evaluation 
may vary from negative through zero to posi- 
tive. 

Perhaps it will appear that this definition of 
belief is a very specialized one. It seems reason- 
able to consider the degree of belief in the 
existence of ESP, God, life after death, grem- 
lins, etc, but with respect to other objects, 
practically everyone would agree that the ob- 
jects exist. Yet, here too, there might be dis- 
agreement as to the extent of existence. For 
example, there is no reason to assume that a 
person living in a totalitarian state will believe 


as strongly in the existence of “the church” as 
individuals living in democratic societies. Simi- 
larly, people living in all-white communities 
may believe less strongly in the existence of 
“Negroes” than people who have had personal 
experience with Negroes. 

The definition of belief thus far considers 
only the probability of the existence of an ob- 
ject. Of course, any object that exists must exist 
in some fashion. In understanding beliefs, 
should we be content simply to measure the 
perceived probability of existence, or should 
we not also be concerned with the precise na- 
ture of that existence? Such a question compels 
us to suggest a distinction between belief in an 
object, and belief about an object. Thus far, we 
have defined belief in, or more completely, be- 
lief in the existence of, an object. One could 
also consider belief in the existence of a rela- 
tionship between that object and some other 
object, or some quality. Having measured belief 
in the existence of ESP, one might also be con- 
cerned with belief in the existence of rela- 
tionships involving ESP, such as “ESP is 
spiritualistic,” “ESP is a type of communication," 
“ESP leads to thought control,” “ESP is bene- 
ficial.” Any of these statements could be placed 
above the AB Scales and the belief in the exis- 
tence of that particular relationship could be 
measured. The various beliefs in the relation- 
ships between an object and other objects or 
qualities would then be defined as beliefs about 
that object. While belief in refers to the existence 
of an object, belief about deals with the nature 
of that object, the manner in which it exists. 
It is obvious that individuals can thus agree in 
their belief in an object, but differ in their be- 
liefs about that object, i.e., in their estimation 
of the various qualities and objects which might 
be associated with a given object. The various 
relationships involved in belief about an object 
are innumerable: "is part of,” “has the charac- 
teristic of," “is opposed to," “leads to,” “is the 
opposite of,” et cetera. Many individuals who 
agree that ESP exists may disagree as to whether 
it is “powerful,” “mystical,” or “insidious.” Reli- 
gious controversy through the ages has obtained 
among individuals who were agreed as to their 
belief in God, but who were not agreed in their 
beliefs about him—as to whether or not he was 
“omnipotent,” “omniscient,” “vengeful,” “had 
a Son,” et cetera. The AB Scale which was used 
earlier to measure belief in an object could 
similarly be used to measure belief about an 
object, since the significant relational statements 


188 ATTITUDE MEASUREMENT 


could also be measured on the belief scale. 
When Carlson (1956) introduced information 
which led subjects to associate desegregation in 
housing with changes in our international rela- 
tions, he was, in effect, manipulating belief 
about segregation, and probably also belief 
about our international relations. 

How, then, could one account for Carlson's 
finding that, for moderately prejudiced subjects, 
attitude toward segregated housing changed 
when he presented information indicating that 
desegregation would lead to improved American 
prestige abroad, and that such action was asso- 
ciated with “equal opportunities for all" and 
"broad-mindedness"? We would consider that 
he had altered belief in a number of relation- 
ships involving desegregation, thus he had al- 
tered beliefs about desegregation. Furthermore, 
the subjects had positive attitudes toward the 
various qualities and objects to which desegre- 
gation was related. From various theories of 
imbalance or dissonance (Festinger, 1957; 
Heider, 1958; Newcomb, 1953; Osgood, Suci, 
and Tannenbaum, 1957; Osgood, 1960; Rosen- 
berg, 1960; Zajonc, 1960), one would expect a 
change in attitude which would restore balance. 
Heider might diagram the relationship as 
shown in Figure 2. 

In these diagrams, p represents the subject; 
x represents “desegregation”; y, “American 
prestige abroad”; the arrows from p to x and y 
indicate “liking” or attitudinal relationships; 
the arrow from x to y indicates a unit-forming 
relationship. In our system, the lines px and py 
represent attitudes of p toward x and y respec- 
tively; the line xy represents a belief in the 
existence of a relationship between x and y—a 
belief about x. 

From this system, it appears that balance 
could also have been restored by changing the 
attitude toward "American prestige abroad" to 
negative. Further, another alternative for re- 
storing balance is for S not to believe in the 


Pre-communication 


Imbalance just after 


existence of the relationship between “desegre- 
gation" and "American prestige abroad." That 
is, the xy relationship might have been changed 
to a negative one. This is quite likely what 
happened with many of Carlson's subjects who 
had strong negative attitudes toward desegrega- 
tion and who did not change their attitudes as 
a result of Carlson's operations. Hypothetically, 
the imbalance situation could not occur if S 
did not believe in the existence of either “de- 
segregation" or "American foreign policy." 

With our scale system, these changes could 
be readily checked by (a) placing "desegrega- 
tion" at the top of the scales, and measuring 
attitude toward and belief in “desegregation”; 
(b) placing “American prestige abroad" above 
the scales and measuring attitudes toward and 
belief in "American prestige abroad"; and (c) 
placing the phrase "desegregation improves 
American prestige abroad" above the scales and 
measuring the belief in the existence of this 
relationship. In addition, a measure of the atti- 
tude toward this last relationship may also be 
obtained. 

Following these considerations, Fishbein 
(1961) has recently investigated the functional 
relationships between attitude toward an object, 
attitudes toward other concepts, and the beliefs 
about the object, i.e., the beliefs in the exis- 
tence of relationships between the object and 
these concepts. The data are supportive of the 
conceptualization discussed above. 


SUMMARY 


In this paper a theoretical and operational 
basis for distinguishing "belief" and "attitude" 
has been suggested. “Belief” was defined as the 
probability dimension of a concept— "Is its ex- 
istence probable or improbable?" "Attitude" 
was defined as the evaluative dimension of a 
concept—“Is it good or bad?" A set of semantic 
differential scales was selected through role- 
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communication 
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playing procedures which yielded a set of five 
polar adjectives that measure belief, and five 
that measure attitude. Tests with several con- 
cepts showed no correlation between ratings of 
objects on these two sets of scales. Communica- 
tions were presented to different groups of Ss 
which appeared to affect “attitude” and “belief” 
independently. In these studies, the reliability 
and validity of the distinction were established. 


Later discussion distinguished between “belief 
in" a concept and “belief about" a concept, the 
latter being defined as belief in the existence of 
a number of relationships between the concept 
and other concepts. It was shown that a change 
in attitude toward a particular concept could 
result from a change in belief about that con- 
cept. Techniques for testing such influence with 
the new scales were suggested. 
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22. The Own Categories Procedure in Attitude Research 


MUZAFER SHERIF and CAROLYN WOOD SHERIF 


The Own Categories Procedure is a general 
method for the study of social attitudes which 
avoids certain limitations inherent in many 
available models for attitude measurement and 
provides more information about the indi- 
vidual's attitude than most of them. The pro- 
cedure yields quantitative indicators of the 
individual's attitude and the degree of its im- 
portance to him, through the number of cate- 
gories he uses and the pattern of his judgments 
in the categories. 

In this paper, we shall first describe the pro- 
cedure, then summarize the theoretical basis 
for its use in attitude research, as briefly as 
possible, Next, research findings will be de- 
scribed which indicate some major variables 
affecting the measures yielded by the proce- 
dure. Finally, the utility of the procedure in 
attitude research will be discussed briefly. 


DESCRIPTION OF PROCEDURE 


The phrase "Own Categories" reflects the 
translation of the conception of attitude into 
operational procedures. Attitude is conceived 
as a set of evaluative categories which the indi- 
vidual has formed (or learned) during his in- 
teraction with persons and objects in his social 
world. When faced with persons, objects, or 
events relevant to his attitude, the individual 
uses these categories for classifying specific 
items as acceptable-unacceptable, good-bad, 
truthful-erroneous, or other appropriate evalu- 
ative terms. In this sense, they are his "own 
categories" for evaluation, as contrasted with 
the division of a stimulus domain under labels 
arbitrarily imposed by an authority figure, an 


* This paper was prepared for the Symposium on 
Attitude Measurement and Attitude Change at the 
International Congress of Applied Psychology, 
Ljubljana, Jugoslavia, 1964. It is used with the per- 
mission of the senior author. 


experimenter, or a survey interviewer. They 
are his personal reference scale for appraising 
the stimulus domain in question. 

To the extent that the object of the attitude 
is socially relevant, individuals belonging to 
the same group or culture will share rather 
similar attitudes, hence each individual's own 
categories are not necessarily unique to him. 
The bounds of their acceptances will vary 
within a range defining the “proper,” “the ac- 
ceptable,” and the “desirable” way of viewing 
the stimulus domain and behaving toward it in 
their group or society. This range of positions 
defined as acceptable, plus a range of positions 
defining what is unacceptable in a group or 
society represents one social norm in their set 
of values or norms, which is an important as- 
pect of its culture. 

The Own Categories Procedure starts with 
preparation of a set of items relevant to an 
attitude. These may be objects, pictures, or 
verbal statements about some topic or issue. 
The content of the items may vary, however, 
the logic of the procedures need not change 
with the content. Major criteria in assembling 
the pool of items will be specified later. In 
research thus far, the number of items has 
varied from 25 to over one hundred. 

The individual's task is simple and natural 
in any culture where sorting objects is a famil- 
iar task. He is instructed to sort the pool of 
items into any number of piles, or categories, 
that seem to him necessary, so that items within 
each category seem to him to “belong together. 
The dimension or attribute which he is to 
judge is specified clearly. For some research 
purposes, he is told to categorize them in terms 
of their acceptability to him. 

If, however, the aim is a “disguised” or 
direct” test of his attitudes, instructions are 
used Which do not arouse his awareness that 
his stand is under the scrutiny of a stranger, oF 


im 
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researcher. Then, he is told to place the items 
into categories as objectively as possible in 
terms of how “favorable” or “unfavorable” they 
are toward the persons, events, issues, or objects 
in question. For example, he may be told to 
categorize statements concerning Negro citizens 
of the United States in terms of their “favor- 
ableness” to the status of the Negro. One ex- 
treme category is specified in instructions, e.g., 
“Put the statements most unfavorable to Ne- 
groes in the first pile.” Otherwise, utmost cau- 
tion is used to avoid hints as to the number of 
categories to be used, and the individual is free 
to distribute the items into the categories in 
any way he chooses. 

When his task is completed to his satisfac- 
tion, the individual is usually asked to order 
his categories. He may be asked to label or de- 
scribe them, either in terms of the explicit 
criterion stated in instructions or in terms of 
their acceptability-unacceptability to him per- 
sonally. These variations depend upon the re- 
search purpose and hypotheses. 


THEORETICAL BASIS OF THE PROCEDURE 


The present approach started with the 
search for reference scales which are psycholog- 
ically and socially valid for the study of attitude 
and attitude change. In an unpublished study 
in 1948 (Sherif, Volkart, and Hovland), we 
found that the bounds of the positions accepted 
and rejected on a social issue vary, even for 
persons upholding the same position as most 
acceptable. A series of studies on a variety of 
social issues over the last 15 years has confirmed 
the early finding, namely, that the ranges of the 
positions that an individual accepts, rejects, and 
toward which he remains noncommittal (when 
not required to evaluate every position) vary 
systematically according to his personal involve- 
ment in the issue (Sherif and Hovland, 1961; 
Sherif, Sherif, and Nebergall, 1965). There isa 
high probability that the individual who en- 
dorses an extreme position will rank that issue 
high in importance, within his scheme of per- 
sonal priorities. But those adopting a moderate 
position with equal ardor display patterns of 
acceptance-rejection-noncommitment similar to 
their more extreme counterparts. 

Specifically, we found that individuals highly 
involved in some stand on a social issue reject 
many more positions than they accept, and 
readily evaluate almost every position as accept- 
able or objectionable. (Their latitudes of rejec- 


tion are approximately twice the size of their 
latitudes of acceptance, and noncommitment 
approaches zero.) Proportional to his lack of in- 
volvement, the number of positions the individ- 
ual accepts and rejects become approximately 
equal and his latitude of noncommitment 
increases. This means that highly involved 
persons have a much broader latitude of rejec- 
tion than persons less concerned, and that they 
remain noncommittal toward fewer positions, 
even when not required to evaluate all of them. 

What psychological principles underlie these 
differences in the size of the individual's lati- 
tudes for acceptance, rejection, and noncommit- 
ment? We had no factual basis for positing a 
typology of individuals as to the range of their 
tolerance, nor for unmeasurable “psycho- 
dynamic mechanisms” which would account for 
the different patterns. We still have no such 
evidence. We sought a theoretical basis for the 
phenomena sufficiently broad to encompass the 
considerable factual evidence that a particular 
individual may tolerate differences on issues of 
little concern to him—to the point of self- 
contradiction—while seeing a matter of high 
personal concern in terms of "black and white.” 
There is also evidence that the ranges for tol- 
eration and condemnation vary from one hu- 
man group to another. 

We found such a basis in the psychology of 
judgment, studied for decades in psychological 
laboratories in various countries. Our account 
of the principles essential to the present ap- 
proach is necessarily brief and sketchy (cf. 
Sherif and Sherif, 1956; Sherif and Hovland, 
1961; Sherif, Sherif, and Nebergall, 1965). 
Whether the object of judgment is a weight, 
the length of a line, the color of a person's 
skin, or the beauty of a girl, judgment is ren- 
dered relative to the immediate stimulus con- 
text in which it appears and to preceding 
contexts. 

For example, quite apart from one's attitude 
on the issue of the segregation of public schools 
according to skin color or national origin, a 
statement such as “We must keep the future 
interests of school children in mind” is ap- 
praised differently when preceded by other 
statements opposed to segregation, on the one 
hand, or favoring segregation, on the other. 

But not all the stimuli present or just pre- 
ceding a judgment have equal weight in affect- 
ing the outcome, even in the psychophysical 
laboratory where the stimuli are motivationally 
neutral. In the orthodox psychophysical meth- 
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ods, the standard stimulus presented with each 
new stimulus for comparison is more influential 
than others less frequently presented, or not 
designated as a standard (Helson, 1959, p. 591). 
Such frequently presented stimuli, or those des- 
ignated as standards, become anchors for the 
individual's judgments. Lacking an explicit 
standard, the individual typically uses the most 
extreme stimuli presented to him as anchors in 
sizing up the intermediate values; and the end 
stimuli thus contribute more than others to his 
judgment of a particular member of the series 
(Parducci, 1963). 

An anchor stimulus or a designated standard 
enhances the accuracy of judgment for items 
coinciding with it in value; but it produces 
systematic shifts or "displacements" in judg- 
ments of objects differing from it in varying 
degrees. An anchor differing slightly from the 
object of judgment results in displacement 
toward the anchor: this is an assimilation ef- 
fect, well known in studies of perception and 
now thoroughly established in judgment (cf. 
Sherif, Taub, and Hovland, 1958; Parducci 
and Marshall, 1962; Helson, 1964). With in- 
creasing discrepancies between the anchor and 
the object of judgment, assimilation ceases and 
displacement begins to occur in a direction 
away from the anchor: the difference between 
the anchor and the object is exaggerated. This 
is the well-known contrast effect. Assimilation 
and contrast in judgment are complementary 
phenomena governed by the relationships be- 
tween anchor and object of judgment. 

In 1950, a series of studies was initiated to 
clarify the differences and similarities between 
judgments of neutral and socially relevant 
items, ranging from experiments with lifted 
weights to studies of the scale values of atti- 
tudinal statements. On this basis, a conceptual 
approach was developed which does account 
for the distinctive patterning of latitudes of 
acceptance-rejection and which provides the 
theoretical basis of the Own Categories Proce- 
dure. The approach is based on the relation- 
ships governing assimilation-contrast effects, 
taking into account the following generaliza- 
tions: 


1. If a person has an attitude toward the 
stimulus domain, his judgments of specific ob- 
jects in that domain are, to some extent, rela- 
tive to the categories of his own reference 
scale, in addition to the context of immediate 
and preceding stimulation. 


2. To the extent that the domain has high 
priority in his scheme of personal relatedness 
with his social world, his latitude of acceptance 
becomes an anchor or standard for his place- 
ment of other items in the domain. In other 
words, the range of positions acceptable to him 
becomes an anchor proportional to his personal 
involvement in upholding it. 

3. To the extent that his own position be- 
comes the most salient anchor in the situation, 
the individual's categorization of items is eval- 
uation of the items. There is now a considerable 
body of research supporting this statement 
(Sherif, Sherif, and Nebergall, 1965). Even 
when instructed to heed only the stimulus at- 
tributes of the items and categorize them on 
an impersonal dimension, the highly involved 
individual performs the task in terms of his 
agreements-disagreements with the items, in- 
cluding his assessments of their truth and 
falsity. One may temporarily force him to fol- 
low instructions by insisting that he compare 
one item with another, as in the method of 
paired comparisons. He can discriminate 
among the items, but barring special arrange- 
ments requiring him to do so, he simply does 
not divorce the task of judging the items from 
his evaluations of them. 

4. When his own stand is the anchor, other 
items will be displaced toward his acceptable 
category (assimilation) or away from it (con- 
trast), proportional to their proximity or dif- 
ference from his own stand. The additional, 
necessary condition for these systematic dis- 
placements is that the objects lack, in some 
degree, objective properties which are readily 
and uniformly perceived as defining member- 
ship in a particular category. For example, 
strongly worded statements of an extreme posi- 
tion on a social issue are not displaced system- 
atically to any significant degree. They are 
readily identified as extreme positions in terms 
of the prevailing social realities. Systematic dis- 
placements are found, however, for less extreme 
and intermediate positions on the same issues. 


With these four additions, the theoretical 
approach accounts for the phenomena which 
are used to study attitudes with the Own Cate 
gories Procedure. 


RESEARCH USING THE OWN CATEGORIES 
PROCEDURE 


Using this conceptualization, Sherif and 
Hovland (1953) predicted that individuals 
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strongly involved in problems of the status of 
Negro citizens would (a) use fewer categories 
for judging statements on the issue than less 
involved individuals, and (b) place fewer state- 
ments in the category of items most acceptable 
to them than in the category most objection- 
able to them. Highly involved Negro students 
(who were the first admitted to a previously 
white university), active white participants in 
the desegregation movement, Negroes at a 
segregated university, a few consistently anti- 
Negro whites, and unselected white students 
were the subjects. They sorted 114 statements 
prepared by Hinckley as the preliminary pool 
for item selection for a Thurstone attitude 
scale. The pool contained pro-Negro and anti- 
Negro statements with a large number of inter- 
mediate items having high Q values (high 
variability). The statements were sorted by the 
Own Categories Procedure, either two weeks 
before or two weeks after the same subjects 
judged them using 11 categories as prescribed 
in the Thurstone “equal-appearing intervals” 
method (Hovland and Sherif, 1952). 

They found that highly involved persons, 
judging these statements in terms of their 
favorableness-unfavorableness to Negroes, pro- 
duced bimodal distributions of the statements 
into categories: intermediate categories were 
neglected, while a disproportionate number of 
items was placed in the extreme category far- 
thest removed from the position the individual 
found acceptable. The bimodal distribution 
was greatly accentuated under the Own Cate- 
gories Procedure owing to the significant ten- 
dency for highly involved persons to use fewer 
categories. In fact, the most militant Negro 
subjects used, on the average, fewer than four 
categories, placed 65 of the 114 statements in 
a single category highly objectionable to them 
and 27 in a category acceptable to them. In 
comparison, unselected white students placed 
43 statements in objectionable categories, and 
38 in categories they later indicated were 
acceptable, 

Using a combined index of skewness of the 
judgment distribution and number of cate- 
gories, Sherif and Hovland found that these 
two measures differentiated between their sub- 
ject groups according to their personal involve- 
ment in the problem of the status of Negroes. 
Proportional to their involvement, their 
threshold for acceptance was raised, as indi- 
cated by a smaller mode in the distribution of 
judgments in acceptable categories. Their 


threshold of rejection was lowered, as indi- 
cated by the disproportionately high mode of 
the distribution at categories they later said 
were objectionable. These peaks in the distri- 
bution reveal the individual's acceptable and 
objectionable categories, the objectionable cate- 
gory being wider. 


Latitude of Acceptance as Anchor 
in Evaluation : 


Carolyn Sherif (1961) compared the evalua- 
tive and nonevaluative categories of American 
Indian and white high school students of com- 
parable educational level. Since the ranges of 
acceptable items varied according to their cul- 
tural backgrounds in several respects, it was 
possible to show contrasting evaluations of the 
same items by the Indian and white students 
according to their latitudes of acceptance, 
while demonstrating that their categorizations 
of neutral items did not differ. 

Using the Own Categories Procedure, each 
individual sorted four sets of items. The Indian 
and white students categorized series of digits 
(numbers) in comparable fashion, attempting 
to fit the items into equal categories according 
to the decimal system. Their categories for 
evaluating the same digits when preceded by 
dollar signs differed significantly from these; 
and differences between the Indians and whites 
were significant. 

Having previously conducted a cultural sur- 
vey, Sherif (1963) knew that the prevailing 
bounds for acceptable expenditures of money 
differed. For example, reflecting an elaborate 
tradition of gift-giving in their culture, Indians 
considered much larger sums of money neces- 
sary for an appropriate gift than whites, but 
much smaller sums than whites were deemed 
acceptable for wearing apparel. When evaluat- 
ing the series of numbers as dollar values, their 
latitude of acceptance was the main anchor, 
with the result that their evaluative categories 
were highly unequal when the range of prices 
exceeded acceptable bounds. To the extent 
that the price range was beyond acceptable 
bounds, the discrepant items were lumped to- 
gether in one category as extremely objec- 
tionable. 

This experiment showed that the “relativity 
of judgment” is not solely in terms of the im- 
mediate and preceding stimulus context of 
judgment, but is also affected by the indi- 
vidual's own categories for what is acceptable 
and objectionable whenever evaluations are 
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rendered. For this reason, the Own Categories 
Procedure is suitable for assessing similarities 
and differences in reference scales prevailing 
in different cultures even when the objects of 
evaluation are not extremely involving to the 
individuals. 


Salience of Attitudinal Anchors Varies 
with Degree of Involvement 


C. Sherif's study also showed that the number 
of categories used varied systematically accord- 
ing to the individual's relative involvement 
with the four different sets of items. On the 
average, all subjects used more categories for 
the digits than they did for series of socially 
valued items. "The three sets of socially valued 
items were ranked in personal importance by 
à paired comparisons procedure. The number 
of categories used in evaluating them differed, 
in turn, according to their rank in personal 
importance. 

More categories were used for the monetary 
values than for descriptions of interpersonal 
behavior, and the fewest categories used for 
assessing the desirability of interpersonal con- 
tact with members of different ethnic groups. 
An intra-class correlation of .38 for the number 
of categories used by the same individuals for 
the four different materials was significant, but 
sufficiently low to indicate that most of the 
variance was accountable to the differing per- 
sonal relevance of the item content. 

A recent experiment by Glixman (1965) 
with different subjects and stimulus materials 
supports this conclusion. He reports that indi- 
viduals used fewer categories in appraising de- 
scriptive statements about themselves than in 
sorting familiar objects, even though a much 
larger number of the self-descriptions was pre- 
sented. He also reports correlation coefficients 
for individual consistency in number of cate- 
gories used for three item contents which vary 
around .35 for his different subject samples. 

It may be objected that significant differences 
in number of categories used by the same indi- 
viduals in categorizing sets of items which dif- 
fer in personal import are confounded by 
differences in the stimulus material itself —by 
the experimenter's incapacity to equate the 
range of the domains in question, and so on. 
The objection is appropriate. In this connection, 
a research thesis by John Reich (1963) directed 
by M. Sherif is pertinent. 

Reich studied the own categories of mature 
women between the ages of 35 and 50 with 


median years of schooling exceeding four years 
of university study. The attitude in question 
concerned legislative re-apportionment. Owing 
to the growth of urban areas, the number of 
representatives to the state legislature from dif- 
ferent districts no longer represented the rela- 
tive sizes of their populations. Reich secured 
active members of the League of Women Vot- 
ers, which had dedicated its major efforts to 
remedying this situation, as individuals highly 
involved in the issue. The less involved sub- 
jects were women school teachers, who also 
favored re-apportionment, but were not par- 
ticularly concerned with the issue, as indicated 
by lack of overt acts and expressions of 
opinion. The two samples were matched by age 
and education as closely as possible. 

It is interesting to note that persons who 
were opponents to re-apportionment changed 
their stands during the study. A Federal Court 
ruling, stating that the legislature must re- 
apportion, shifted his anti-reapportionment 
subjects to a pro-apportionment position ac- 
cording to an out-moded formula which no one 
had supported up to that time. We mention 
this change to emphasize the fact that the indi- 
vidual's own categories are inevitably tied, to 
some extent, to the reference scales prevailing 
in the social realities of his time and place. 
Similar shifts in the positions on psychosocial 
scales have been observed in the United States 
over the past century by groups favoring and 
opposing a federal income tax and, more re- 
cently, desegregation of public facilities. 

From a pool of 120 statements on the issue of 
re-apportionment, Reich chose 60 from pre- 
tests with 79 subjects. Fifteen were consistently 
judged as pro, 15 consistently as anti, and 30 
had been judged with high variability. In cate- 
gorizing these 60 statements by the Own Cate- 
gories Procedure, 74 per cent of the highly 
involved women used four or fewer categories, 
while only 26 per cent of the teachers used 
such a small number. The less involved teach- 
ers placed about the same number of sae 
ments in favorable and unfavorable categories, 
but the highly involved League members placed 
over half of the statements in unfavorable cate- 
gories, which they found extremely objection- 
able. 

The League members saw the issue in “black 
and white,” and mostly “black,” despite the 
fact that they were equally well educated and 
highly conversant with the complexities of leg- 
islative reapportionment. Because of their 
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superior information on the issue, they doubt- 
less could have made finer discriminations than 
the teachers. The fact is, that given the oppor- 
tunity to categorize the positions in their own 
way, they did not make fine discriminations. 


The Crucial Role of the Intermediate Items 


An earlier study by La Fave and Sherif (1962) 
provided convincing evidence that the distinc- 
tively bimodal distributions of judgments ob- 
tained by the Own Categories Procedure 
occurred through systematic displacements of 
the intermediate items. Twenty-five items on 
the issue of desegregation of public facilities 
were selected through pre-tests in order to have 
five representatives in each of the following 
classes: Very Integrationist, Integrationist, 
Moderate, Segregationist, and Very Segrega- 
tionist. Variability in placement of the items 
in the two extreme categories was low. Three 
samples judged the position of these 25 state- 
ments under the Own Categories instructions. 

The highly involved, favorable subjects were 
95 Negroes from a state Negro university in 
the southwest; the least involved were 144 un- 
selected white undergraduate students in a 
desegregated university. In an effort to secure 
highly involved, anti-Negro subjects, 79 mem- 
bers of a college fraternity with strong ties to 
Southern tradition were used. The fraternity 
members, however, were a heterogeneous lot, 
as noted in the next section. 

Owing to the small number of items (25), 
variations in number of categories used were 
less than in other studies previously cited. 
However, 92 per cent of the Negro subjects 
used four or fewer categories, and fully 68 per 
cent used three or fewer. In comparison, 46 
per cent of the unselected white students used 
five or more categories for the 25 items. Of 
the Southern Fraternity members, 77 per cent 
used four or fewer categories. 

Over 87 per cent of Negroes placed more 
items in categories which they labeled as favor- 
ing segregation than in those favoring desegre- 
gation. Among Southern fraternity members, 
the distribution was reversed by 59 per cent 
of the sample, that is, more judgments were 
made as favoring desegregation. Unselected 
white students distributed their judgments 
more evenly, 55 per cent of them placing equal 
numbers of items in favorable and unfavorable 
categories. 

Tracing down the items responsible for the 
bimodal distributions of most Negro and a ma- 


jority of Southern fraternity members, La Fave 
and Sherif were able to show that they occurred 
through systematic displacements of 13 state- 
ments which were intermediate to the extremes 
and had high Q values. The Negro subjects 
using fewer categories accumulated such state- 
ments in categories unfavorable to integration 
—a contrast effect relative to their own pro- 
integration stand. 'The Southern fraternity 
members, a majority of whom opposed inte- 
gration, accumulated such statements in cate- 
gories favoring integration—again a contrast 
effect. 

This research shows the importance of one 
of the major requirements in preparing a pool 
of items for the Own Categories Procedures: 
the array must include, between the extremes, 
a large number of items which are "inter- 
mediate" in position or are ambiguous in their 
position with respect to the extremes. Judg- 
ments of such items are characterized by great 
variability. They are, in short, items which are 
typically discarded in the construction of most 
conventional attitude tests as too variable. 

Studies by Zimbardo (1960) and La Fave 
et al. (1963) show that the characteristics of 
such items which make them subject to system- 
atic displacement according to the individual's 
attitude are not adequately described simply 
as "ambiguity," and that the inclusion of adjec- 
tives with positive or negative connotation re- 
duces variability in placement. The items which 
we have found most suitable are those repre- 
senting positions on an issue which are not 
well known, positions which are the object of 
wide disagreement (for example, “neutrality” 
on a "hot" intergroup or international issue), 
and statements whose meaning is "indeter- 
minate." An example of the latter is borrowed 
from Zimbardo's study (1960) of university 
students majoring in' science or in the humani- 
ties, namely: "Anyone who has known a scien- 
tist personally will know why science is where 
it is today." 

The second major requirement for prepar- 
ing a pool of items is that the range or variety 
should be large, including extreme positions on 
social issues or, in cross-cultural comparisons 
of reference scales, covering a range beyond 
usual experience and tradition. This require- 
ment reflects the fact that the particular range 
of values presented for judgment does affect 
the placement of particular items. 

For example, C. Sherif (1961) found that 
Indian and white students used narrower cate- 
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gories, thus discriminated more keenly, when 
presented a range of items which were largely 
acceptable to them than when objectionable 
items were also included. This corresponds to 
the well-known tendency to see fine shades of 
difference when dealing with friends or allies, 
but to ignore many of these differences when 
one's friends are compared with enemies. 


Comparison of Findings for Persons For 
and Against the Object of Attitude 


In the studies reviewed thus far, the use of 
few categories and the bimodal distributions 
of items in them have been shown most deci- 
sively for individuals who uphold stands in- 
tensely favorable toward the objects. The 
practical reason for this deserves comment. In 
selecting highly involved subjects, we have re- 
lied as much as possible on their known public 
actions and statements on the issue in question. 
Particularly in relation to the Negro issue, it 
has not been possible to obtain subjects who 
publicly state and act upon an anti-Negro 
position. 

In an effort to remedy this lack, we have 
used not one, but a combination of criteria 
for an anti position, including conventional 
paper-and-pencil tests of attitude toward the 
Negro, self-ratings, and observation of behavior. 
On the basis of these criteria, we found that 
only 41 of the 78 Southern fraternity members 
could be classified as consistently anti-Negro. 

The inconsistencies we have found were 
traceable to self-reports of attitude or paper- 
and-pencil tests. The explanation is not com- 
plex. There is a collegiate norm which prevails 
among students in all but a few places which 
makes an open stand against the Negro simply 
"not the thing to do." In their self-ratings and 
paper-and-pencil tests, aware that the researcher 
is interested. in their attitudes, many individ- 
uals respond in terms of the researcher's expec- 
tations, rather than in terms of their views— 
Which they subsequently belie through anti- 
Negro statements to friends and sympathizers. 
“Social desirability” (Edwards, 1957) of their 
Tesponses is the salient consideration in the 
research situation. Avoidance of this problem 
is a major advantage of the Own Categories 
Procedure, since it may be presented as a com- 
pletely “objective” task to the subject. 

In view of the lack in other studies, the 
use of overtly anti subjects by Katheryn 
Vaughan (1961) is particularly significant. 


Vaughan conducted her study near the U. S. 
Mexican border, where persons of Mexican 
origin compose a substantial portion of the 
population, and in another part of the same 
state about 600 miles north, where such per- 
sons are not common. In the vernacular, per- 
sons of Mexican origin or descent are called 
"Latins" (a more polite term than some others) 
and non-Latins are "Anglos." 

Replicating the Sherif-Hovland study (1952, 
1953), Vaughan compared the placements of 
statements about Latins, with 11 categories 
prescribed in instructions and with the Own 
Categories Procedure. She constructed a pool 
of 103 statements, selecting 60 which were com- 
posed of five sub-sets: two sub-sets consistently 
placed at the extremes, two with moderate 
variability but scale values on either a favor- 
able or unfavorable side, and one sub-set with 
extremely high variability. 

The results for the 11 instructed categories 
represent a striking mirror image of those found 
for Negro and pro-Negro whites in the Hov- 
land-Sherif study (1952). Intermediate cate- 
gories were neglected by intensely anti-Latins, 
their extreme category for pro-Latin statements 
was very broad, and a smaller mode in the 
distribution of judgments occurred at the ex- 
treme anti-Latin end, with which they agreed. 
Residents of the area with sparse Latin popu- 
lation, and little concern with it, distributed 
their judgments fairly evenly over the 1l 
categories. 

The same trends were greatly accentuated 
using the Own Categories Procedure. Over 85 
per cent of the anti-Latins used three or fewer 
categories, while almost 92 per cent of the un- 
involved subjects used four or more categories. 
The range was 2-5 categories for involved anti- 
Latins and 2-11 for involved subjects, but only 
8 per cent of the latter used three categories 
or fewer. 

Vaughan at no time requested any statement 
as to which categories were acceptable or ob- 
jectionable to the subjects; but she reports that 
many of the highly anti-Latin persons had 
great difficulty in following instructions to cate- 
gorize the statements objectively as favorable- 
unfavorable to Latins. They kept insisting that 
many of the statements were "not true," and 
they had all sorts of "facts" to prove it. This 
observation represents the considerable evi- 
dence that the individual's Own Categories 
for acceptance-rejection do serve as the refer- 
ence scale for his placements, even when he 
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is warned not to use them, provided that the 
object of judgment is of importance to him. 


CONCLUDING REMARKS 


The Own Categories Procedure does not 
suffer from certain limitations of most available 
techniques for attitude measurement. Its logic 
does not derive from statistical or physical 
models for measurement, but from widely gen- 
eral principles of psychological functioning. 

The task required of the individual does not 
presume a common culture or educational level, 
but is applicable with changed content to any 
cultural setting where objects are sorted. 

It may be used in the study of attitudes 
without arousing the individual's awareness 
that his attitude is being explored for research 
purposes. 

It yields quantitative measures both for the 
location of his acceptable and most objection- 
able categories and his degree of involvement 
in the matter. A restricted number of categories 
with the greatest mode .in the judgment dis- 
tribution at the objectionable segment and a 
secondary mode at the acceptable segment is 
typical of highly involved persons. To the ex- 
tent that the individual is little involved in 
the matter at hand, his distribution of the 
items approaches more equal divisions into a 
greater number of categories. In this event, 
the acceptable and objectionable categories 
may be ascertained after he has sorted them, 
from the labels he uses for the various cate- 
gories. 

When appropriate for the research purposes, 
the individual may be asked directly to evalu- 


ate the objects in terms of his acceptances- 
rejections. By varying the stimulus material, it 
is possible to assess individuals’ categories rela- 
tive to those prevailing in their own and dif- 
ferent cultural groups, and to predict their 
reactions to cultural items deviating from their 
latitudes of acceptance. In fact, we believe that 
findings on the individual's own categories will 
have much greater predictive value than most 
conventional tests yielding a single position to 
represent the individual's attitude. 

For example, in our study of reactions to 
communications on the 1960 Presidential Elec- 
tion we found that highly committed partisans 
of the major parties judged a communication 
mildly favoring their opponent as much more 
extreme than it was—contrasting it to their 
own position by displacing it away from their 
own position. Less involved persons adopting 
the same position as most acceptable assimi- 
lated the identical communication toward their 
own position. This finding is predictable when 
one knows that the less involved person has a 
more finely differentiated set of categories and 
that he is noncommittal toward those in the 
segment of positions presented by the speaker. 
The highly involved person, on the other hand, 
has very broad categories, including one or 
more for objectionable positions, and this 
latitude of rejection covers moderate positions. 

In our opinion, the use of similar theory 
and procedures for the study of reactions to 
sociocultural change and innovation is one of 
the most exciting among several problems to 
be explored through the Own Categories 
Procedure. 
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23. Some Studies of Social Distance 


HARRY C. TRIANDIS and LEIGH M. TRIANDIS 


INTRODUCTION 


In all societies there are norms specifying 
the “correct” or “appropriate” behaviors for 
members when they encounter other people. 
For instance, some societies have what is known 
as a “mother-in-law taboo,” a norm which 
states that “when you are in the presence of 
your mother-in-law, you hide or avoid getting 
close to her.” Here the “person” is identified 
by the kin relationship, and the behavior is 
one of keeping physical distance. 

In our society, certain kinds of people have 
norms of avoidance of persons who are differ- 
ent from themselves with respect to such char- 
acteristics as physical type (race), belief system 
(religion or political philosophy), occupation, 
nationality, and so forth. Avoidance may in- 
volve exclusion of persons with such “unde- 
sirable” characteristics from the “friendship 
circle,” the neighborhood, the occupational 
group, the place of work, and sometimes from 
the country. The degree to which individuals 
are willing to accept people who differ from 
themselves into their own social group may be 
considered a measure of their “social distance” 
from these out-group persons. Research shows 
that a number of factors influence social dis- 
tance judgments. The most important of these 
factors will be considered in this article. They 
include both characteristics of the persons be- 
ing judged—their race, religion, nationality, 
occupation, and sex; and characteristics of the 
judges—their culture, social class, educational 
level, and personality traits. 


THE MEASUREMENT OF SOCIAL DISTANCE 


It is possible to find out what kinds of 
people are avoided, and how much they are 


* Reprinted from I. D. Steiner and M. Fishbein 
(Eds,), Current Studies in Social Psychology, Holt, 
Rinehart and Winston, New York, 1965, with per- 
mission of the senior author and publisher. 


avoided, by asking a person to indicate 
whether he would accept a particular kind of 
person (for example, a Negro) as an intimate 
friend, as a neighbor, as a fellow employee, 
and so on, or whether he would exclude him 
from such relationships. The sociologist Bo- 
gardus established a scale of social distance by 
employing the following items, which are listed 
below in the order of increasing social distance: 
(1) would admit to close kin by marriage; 
(2) would admit to my club as personal chum; 
(3) would admit to my street as neighbor; 
(4) would admit to employment in my occu- 
pation; (5) would admit to citizenship in my 
country; (6) would admit as a visitor only to 
my country; (7) would exclude from my coun- 
try. Bogardus (1928) found that American white 
subjects maintained little social distance toward 
Englishmen, Canadians, and Northern Euro- 
peans, and more social distance toward South- 
ern Europeans such as Italians and Greeks. 
The greatest amount of social distance was 
found for people who differed from the sub- 
jects with respect to racial characteristics (such 
as, Orientals and Negroes). 

However, these results are ambiguous. When 
an American white subject indicates much so- 
cial distance toward Negroes, it is difficult to 
know whether he rejects them because of their 
physical type or their probable lower-class back- 
ground. Or, to take another example, when an 
American shows social distance toward Irish- 
men, does he object to their nationality or 
to their probable religion (Roman Catholic)? 
Triandis and Triandis (1960) avoided this am- 
biguity by asking their white American subjects 
to react to hypothetical persons who were de- 
scribed in greater detail than is customary in 
studies of social distance. Thus subjects were 
asked to indicate their acceptance of a Negro 
who is a physician, Portuguese, and a Roman 
Catholic. By a statistical technique called anal- 
ysis of variance it was possible to find out 
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whether the race, the occupation, the nation- 
ality, or the religion of the stimulus persons 
determined the social distance. 

Let us clarify this procedure. The first step 
requires a standardization of the social distance 
scale. A large number of statements, such as 
those used by Bogardus, are presented to a 
group of judges. The judges are asked to indi- 

- cate, on an ll-point scale, how much social 
distance is implied by agreement with each of 
the statements. Typically the statement, “I 
would marry this person,” is given as an an- 
chor, and defines the zero social distance point; 
the statement, “I would kill this person,” is 
used as the anchor for the other end of the 
scale. When that is done, a judge may decide 
that agreement with the statement, "would 
exclude from the neighborhood," implies con- 
siderable social distance—say, six points on an 
eleven-point scale. A group of judges typically 
provides a distribution of such judgments for 
each statement. The distribution of judgments 


for each statement is analyzed by a technique 
developed by the psychologist Thurstone, and 
a single scale value is derived for each state- 
ment. 

Table 1 shows some social distance state- 
ments and some values obtained from such a 
standardization process. Suppose that an Ameri- 
can subject is asked whether he would or would 
not do the various things listed in Table 1 
with a "Negro Physician." Let us say that a 
subject indicates that he would not accept the 
Negro Physician as an intimate friend, as a 
close kin by marriage, as a roommate, or as 
a personal chum, but would accept him as a 
neighbor, as a family friend, in the same apart- 
ment, and so on; and would not do any of the 
negative things that are listed. Then the social 
distance of this subject toward "Negro Physi- 
cians" is 38.7. 

Let us assume that a subject obtains the 
social distance scores shown in Table 2 when 
he evaluates four types of persons: Negro phy- 


TABLE 1. Scale Values of Statements Used in Two Cultures 


American Greek 
Scale Scale 
Statement Value Value 

I would marry this person 0.00 0.00 
I would accept this person as an intimate friend 1.1 13.5 
I would accept this person as a close kin by marriage 21.5 28.5 
I would accept this person as a roommate 29.5 — 
I would accept this person as a member of my intimate 

social group (in Greek, parea) — 31.1 
I would accept this person as a personal chum in 

my club 31.1 — 
I would accept this person as my family’s friend 40.9 24.0 
I would accept this person as a neighbor 38.7 — 
I am going to invite this person to dinner — 83.3 
I would live in the same apartment house with 

this person 49.4 — 
I would rent a room from this person 57.5 42.8 
I would be willing to participate in the lynching of 

this person (in Greece: I would kill this person if 

I had the chance) 100.0 100.0 
TABLE 2 
DnA 

Negro White Total 

Physician 38.7 Ih 49.8 
Unskilled laborer 68.7 21.5 90.2 } os 

Total 107.4 32.6 


D eee Gam md 
Difference — 74.8 


sicians, Negro laborers, white physicians, and 
white laborers. " 

It is intuitively clear that this subject pays 
more attention to the "race" of the stimulus 
than to the "occupation," because the differ- 
ence between the "Negro" and "White" stim- 
uli is about 75 points, while the difference 
between the “Physican” and “Unskilled La- 
borer” stimuli is only about 40. 

When the analysis of variance technique is 
used, it is possible to learn several things: 


l. Whether the subject “pays attention" to 
a particular characteristic. For instance, a sub- 
ject might obtain the following scores: 


Negro White 
Physician 11.1 11.1 
Unskilled laborer 68.7 68.7 


This pattern of scores would indicate that 
"race" is of no relevance whatever in the deter- 
mination of his social distance scores. In the 
analysis of variance, the characteristic "race" 
would be found to control no "significant 
amount of variance." It is desirable to compute 
the statistical significance of the variance of 
the social distance scores that is controlled 
by each of the characteristics of the stimulus 
persons so that one may disregard apparent 
effects that are probably accidental or unde- 
pendable. The level of significance tells us 
whether the obtained difference in social dis- 
tance scores, when shifting from the “preferred” 
to the "nonpreferred" value of the character- 
istic, is likely to have been observed by chance, 
or not likely to have been observed by chance. 
In the latter case, we may infer that the subject 
has been "paying attention" to the character- 
istic, 

2. It is possible to learn the relative impor- 
tance of the characteristics for the particular 
subject. The statistical procedure indicates how 
much variability in social distance scores is 
controlled by each of the characteristics, and 
also how much total variability there is in the 
social distance scores obtained by a particular 
subject. The percentage of the total variability 
which is controlled by a specific characteristic 
serves as an index of the weight of that char- 
acteristic for the subject. For example, if the 
total variability were 100, race might control 
60 percent, occupation $0 percent, and "error" 
10 percent of the total variability. We would 
say that the weights for race and occupation 
are 60 and 30. This means that race controls 
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twice as much variability (or makes twice as 
much difference) as does occupation in the 
determination of the subject's social distance 
scores.! 

3. Finally, we can learn whether there are 
so-called "interactions" between the character- 
istics. In the first example shown above, there 
is an interaction between race and occupation. 
"The subject shows some social distance toward 
Negroes and some toward unskilled laborers; 
but he shows a particularly large amount of 
social distance toward the stimulus that has 
both the "nonpreferred" characteristics. "The 
statistical technique also permits checks of the 
statistical significance of this interaction. 


Thus, the use of this procedure permits us 
to learn at least three things: (1) whether the 
subject really pays attention to a particular 
stimulus characteristic, (2) whether he pays 
attention to combinations of characteristics, 
and (3) how much importance he attaches to 
each of the characteristics and their combina- 
tions. 


THE RELATIVE IMPORTANCE OF THE 
CHARACTERISTICS OF THE STIMULUS 
PERSONS 


The first study that used this technique 
showed that American white subjects paid some 
attention to all four of the characteristics used 
in the study and attached the following 
weights to them: race 77, occupation 17, reli- 
gion 5, and nationality 1. (Note: these weights 
add to 100.) When the responses of the subjects 
were averaged out before the analysis, there 
was no evidence of any interactions but for 
individual subjects there were many inter- 
actions. In other words, one subject might 
have shown particularly large social distance 


1 To be completely correct, this procedure requires 
the subtraction of the error variance from the 
variance due to a particular source. The "correc- 
tion" formula typically leads to ratios or weights 
that differ by less than 10 percent from the ratios 
obtained without the correction. In the example 
in the text this correction would result in weights 
of 45.5 and 182, which are still in the ratio of 
about 2 to 1. In all the work reported in this ar- 
ticle, this correction has not been used because the 
“weights” obtained by the more elaborate proce- 
dure are extremely similar to the uncorrected 
weights. As long as one is interested in the ques- 
tion, "what is the order of importance of the 
weights," it is unnecessary to make the correction, 
since the corrected and uncorrected weights give 
the same answer. 
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toward “unskilled Negroes,” and another to- 
ward “unskilled Jews.” 

Weidemann and Triandis (in preparation) 
did a study in which the characteristics being 
considered were the sex, race, religion and sta- 
tus of the stimulus persons. Since many of the 
social distance items that are appropriate for 
other groups (such as, admit to my street as 
neighbor, admit to close kin by marriage, ex- 
clude from my country) are not very appro- 
priate for measuring attitudes toward women, 
the authors used a series of questions which 
involved placing the stimulus persons in posi- 
tions of varying status. In order to control for 
the context in which the status differential 
occurred, five triads of superior, equal and 
subordinate status positions were used. One 
such triad, for example, was as follows: 


Superior Status: a director of a play in 
which you are acting 

Equal Status: another actor or actress with 
you 

Subordinate Status: a stage hand in a play 
in which you are acting. 


'The subjects of this study were white and 
predominantly Christian students at the Uni- 
versity of Illinois. The weights obtained for 
the male subjects were: race (white-Negro), 46; 
religion (Christian-Jewish), 25; sex  (male- 
female), 28; status (superior-equal-subordinate), 
l. The weights for the women were: race 60, 
religion 26, sex 13, and status 1. Thus the 
women placed more emphasis upon race and 
less on sex than did the men. Whites were 
preferred to Negroes, Christians to Jews, and 
men to women by both male and female sub- 
jects. Status was unimportant in determining 
social distance, when considered for all stimu- 
lus persons. However, interaction effects 
showed that both Negroes and women were 
rejected in positons of superior status by both 
male and female subjects. Although the over- 
all social distance toward women is consider- 
ably less than that expressed toward Negroes, 
women were rejected more strongly than were 
Negroes in superior positions; that is, subjects 
preferred a Negro man to a white or Negro 
woman for superior status positions. ‘This pref- 
erence occurred for both male and female 
subjects but was much stronger for women 
(race x status = 26, sex x status = 74) than for 
men (race X status — 42, sex x status — 58). 


SUBJECT CHARACTERISTICS: FIRST STUDY 


Some subjects showed much more social dis- 
tance than other subjects. Thus, social distance 
appears to be determined not only by societal 
norms, but also by the demographic and per- 
sonality characteristic of the subjects. 


Demographic Characteristics 


Triandis and Triandis (1960) found that up- 
per class subjects tend to show less social dis- 
tance than lower class subjects; that Jewish 
subjects tend to show less social distance than 
Christian subjects; and that subjects with a 
Southern or Eastern European national back- 
ground tend to show less social distance than 
subjects with a Northern or Western European 
background. These differences probably reflect, 
in part, the norms of these various societal 
groups and, in part, the historic factors that 
are related to immigration to this country. 


Personality Characteristics 


Some subjects, regardless of ethnic or na- 
tional background, show more distance than 
other subjects. In the study described above, 
subjects who scored high on the California 
Fascism Scale tended to show more social dis- 
tance than did subjects who scored low on 
that scale. The California F-scale is an instru- 
ment that reflects (inversely) the social sophis- 
tication or breadth of perspective of the 
subject (Kelman and Barclay, 1963). 

A good deal of previous research employing 
the F-scale, and other scales designed to mea- 
sure different aspects of the subject's person- 
ality, suggested that a person may show more 
social distance if he is insecure about his own 
merit. A secure person is capable of facing 
people different from himself, who may chal- 
lenge his values and assumptions much more 
comfortably than is an insecure person. People 
hold many kinds of beliefs about which there 
can be no objective proof. The only way for 
them to feel sure that their beliefs are correct 
is to talk with other people who hold the 
same views. Thus, people who hold different 
beliefs pose a real threat to an insecure indi- 
vidual and he is likely to avoid them. If an 
individual has a narrow, unsophisticated pet 
spective, he is likely to have a very simple sys 
tem of beliefs, and “strange” people can 
be particularly threatening to this system 
(Rokeach, 1960). 


————————— nnn ee - ui ta — - = 


To summarize, then, a person’s social dis- 
tance from other people is in part determined 
by the norms of his social goup and in part 
by his own personality. Norms are learned from 
talking to his parents, friends, and valued asso- 
ciates; personality is determined by both ge- 
netic predisposing factors and the childhood 
training experiences to which he was subjected 
by his parents. His general education and level 
of sophistication are additional factors that 
may broaden his perspective and make other 
people, who are different from himself, less 
threatening. 


SUBJECT CHARACTERISTICS: SECOND 
STUDY 


From these considerations, Triandis and 
"Triandis (1962) derived two hypotheses: (1) dif- 
ferent cultures have different norms about so- 
cial distance, and (2) within a culture, the more 
insecure and anxious an individual is, the 
higher the amount of social distance he will 
feel towards people who are not like him- 
self. 

To test these hypotheses, Triandis and 
Triandis administered a questionnaire to 100 
university students at the University of Athens, 
Greece, and to 100 students at the University 
of Illinois. The questionnaire presented com- 
plex stimuli varying in all possible combina- 
tions of the following characteristics: Race: 
White-Negro; Occupation: Bank Manager-Coal 
Miner; Religion: Same-Different from that of 
the subject; and Nationality: French-Portu- 
guese. The “same-different” religion was ex- 
plained to the subjects by asking them to think 
of their own religion when seeing the words 
“same religion”; and of that religion, out of 
a set that included different Christian and Jew- 
ish denominations, which they considered most 
different from their own when seeing the words 
“different religion.” In other words, an Ameri- 
can Jewish subject who saw the stimulus: 
“Portuguese, Negro, Bank Manager, Different 
Religion" may have responded to "Portuguese, 
Negro, Bank Manager, Roman Catholic"; and 
a Greek Orthodox subject, looking at the same 
stimulus may have responded to a “Portuguese, 
Negro, Bank Manager, Conservative Jew.” 

The first step in the study involved separate 
standardizations of the social distance scales in 
the two cultures. The scale statements were 
standardized by the Thurstone successive inter- 
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vals technique, described above, using a popu- 
lation of 100 Greek high school students. 

Table 1 presents some of the statements used 
in the study and shows the scale values of the 
statements that were obtained from the Ameri- 
can and Greek judges. 

The 16 stimuli generated by the character- 
istics mentioned above (race, religion, occupa- 
tion, and nationality) and the social distance 
statements were presented to a new sample of 
100 Greek and 100 Illinois students. The pat- 
terns of responses obtained from the two cul- 
tures had numerous similarities. In both 
cultures the preferred stimulus was the 
“French, Bank Manager, White, Same Reli- 
gion.” In both cultures the “Portuguese Miner, 
Negro, Different Religion" stimulus was the 
least preferred. The average distance toward 
the latter stimulus was 37.1 and 33.4 for the 
American and the Greek subjects respectively. 
The average American subject would accept a 
“Portuguese Miner, Negro, Different Religion" 
person as a neighbor and the average Greek 
would invite him to dinner. But, it must be 
remembered that these average social distance 
scores are greatly depressed by the large num- 
ber of both Illinois and Greek students who 
would accept all of the stimuli under all cir- 
cumstances. Actually, although none of the sub- 
jects indicated that they would be willing to 
kill any of the stimulus persons, there were 
some who would exclude Negro or Jewish stim- 
uli from their country and many who would 
exclude them from their neighborhoods. 

The similarities between the American and 
Greek scores suggest that there is about the 
same variability in the social distance scores in 
the two cultures. "The next question is this: 
Are the social distance scores determined by 
the same characteristics in the two cultures? 
The answer is no, particularly if we pay atten- 
tion to the relative weights of the character- 
istics. 


The Culture of the Subjects 


The American subjects in this study gave a 
large weight to race; the Greeks gave a large 
weight to religion. The American weights were 
as follows: race 86; occupation 3; religion 8; 
and nationality 3. The Greek weights were: 
race 24; occupation 5; religion 56; and nation- 
ality 0. Thus, the Americans gave large weights 
to race as they had done in the previous study 
(Triandis and Triandis, 1960), and small 
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weights to religion (anti-Semitism) and occu- 
pation, while the Greeks gave religion the big- 
gest weight and race and occupation smaller 
weights. 

This finding supports the hypothesis that dif- 
ferent cultures have different norms about so- 
cial distance. The average American subject 
apparently focuses more on race, and the aver- 
age Greek subject more on religion. This hy- 
pothesis is further strengthened by Triandis, 
Davis and Takezawa (in preparation). These 
researchers repeated the Trandis and Triandis 
(1962) study with German high school students 
and Japanese university students. For the Ger- 
man study they used stimuli formed from dif- 
ferent combinations of race (Negro-white), 
occupation (physician-unskilled laborer), reli- 
gion (Protestant-Catholic-Jewish), and nation- 
ality (German-American-Italian). The Japanese 
stimuli were also formed with the characteris- 
tics of race (Negroid-Caucasoid-Mongoloid), 
occupation (physician-unskilled laborer), reli- 
gion (Protestant-Catholic), and nationality (Jap- 
anese-American-Portuguese). The same stimuli 
were also presented to two samples of Illinois 
students. 

First, we might ask, what kind of results were 
obtained from the American students who re- 
sponded to the "German study stimuli" and 
those who responded to the "Japanese study 
stimuli.” Since the stimuli were a little dif- 
ferent from those used in the previous studies, 
we should expect some differences. The weights 
obtained with the German stimuli and white 
American Protestant subjects were as follows: 
race 60; occupation 22; religion 6; and nation- 
ality 1. For the Japanese stimuli the weights 
were: race 57; occupation 35; religion 1; and 
nationality 4. Table 3 summarizes these find- 
ings. The stars indicate the level of significance 
of the results. When there are four stars it 
means that there is one chance in 10,000 that 
the subjects in the particular culture did not 
pay attention to the corresponding charac- 
teristic. 

The results of Table 3 are fairly consistent 
with the previous results. The major difference 
occurs on the weight given to occupation. This 
is probably due to the fact that the difference 
between Bank Manager and Coal Miner em- 
ployed in the Greek study is smaller than that 
between Physician and Unskilled Laborer em- 
ployed in the German and Japanese studies. 
The other difference is on religion and nation- 


ality between the Greek and the Japanese 
studies. But here again the stimuli account for 
the difference. The Japanese stimuli did not 
include the characteristic "Jewish," hence the 
drop in the size of the weights from 8 to 1, 
and they did include the nationality “Japa- 
nese," while in the Greek study the nationali- 
ties were French and Portuguese. These results 
suggest that the weights depend on the extent 
to which one uses extreme characteristics to 
describe a particular dimension. Thus, weights 
must be considered in conjunction with the 
kinds of stimuli used in a particular study. 
However, when samples from two different cul- 
tures are given the same stimuli, differences in 
the weights will reflect differences in emphases 
that are due to the cultures of the subjects. 
The German subjects provided the following 
weights: race 6; occupation 70; religion 12; 
and nationality 2. There is a spectacular dif- 
ference between the American and German 
subjects in the weights given to race and occu- 
pation, with the American subjects giving 
more weight to race and less weight to occu- 
pation than do the Germans. The Germans 
also give more weight to religion and nation- 
ality than do the Americans. In the case of the 
Japanese, the weights were: race 38; occupation 
50; religion 0; and nationality 3. Again the 
Americans give more weight to race and less 
weight to occupation than do the Japanese. 
To sum up, there is little doubt that differ- 
ent cultures employ different weights in the 
determination of the social distance. The typi- 
cal American weights are much larger for the 
characteristic race than for other characteris- 
tics. The typical Greek weights are larger for 
the characteristic religion. The typical German 
weights are very high for occupational status, 
moderately high for religion, and relatively 
low for race. The typical Japanese weights are 
very high for occupation and race, and small 
for nationality. It must not be assumed, how- 
ever, that these results characterize every sub- 
ject in each of these cultures. There is a great 
deal of variability within cultures that is attrib- 
utable to the religion and social class of the 
subjects. For example, Triandis, Davis and 
Takezawa found that the weights given by dif- 
ferent subsamples of American subjects were 
quite different. The American Jewish male 
subjects showed little social distance, and the 
amount they did show was mostly determined 
by the occupation of the stimulus persons. 
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The American Jewish female subjects showed 
a similar pattern, though they did give sub- 
stantially larger weights to “religion” than did 
the male subjects. 


The Personality of the Subjects 


In both their American and their Greek 
samples, Triandis and Triandis (1962) found 
that subjects who obtained high social distance 
scores tended to answer “yes” to a wide variety 
of attitude questions, including items from the 
California F-scale and statements that were in- 
tended to be the obverse of those items. Per- 
sons who obtained high social distance scores 
also showed a tendency to check the most ex- 
treme positions on graphic rating scales, thus 
indicating either very strong agreement or very 
strong disagreement with attitude statements. 
Psychologists are not agreed concerning the 
meaning of “acquiescent” and "extreme" styles 
of response to attitude questionnaires, but such 
reactions are commonly believed to reflect per- 
sonality factors. Through the use of analysis 
of variance, Triandis and Triandis were able 
to show that the cultural backgrounds (Ameri- 
can or Greek) of their subjects controlled 
almost twice as much variance in social distance 
scores as did the two response styles. 

In a later study (Triandis, Davis and Take- 
zawa), several personality measures were ob- 
tained and correlated with social distance scores. 
American subjects who scored high on social 
distance tended to be low on independence of 
judgment (measured by a scale developed by 
Barron) and on tolerance for ambiguity (mea- 
sured by a scale developed by Budner). German 
subjects who were high on social distance also 
showed low tolerance for ambiguity (measured 
by a scale developed by Brengelmann). Thus, 
there is a cross-cultural replication of this result, 
with two different kinds of scales measuring 
tolerance for ambiguity. This characteristic is 
related to the ability of the subject to "suspend 
judgment" until he has considerable informa- 
tion. Subjects who have low tolerance for am- 
biguity tend to "prejudge" other people on the 
basis of obvious group characteristics such as 
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their race or religion. From the Weidemann 
and Triandis (in preparation) study, it is worth 
noting that "prejudice toward women," as mea- 
sured by the Nadler-Morrow (1959) Scale, cor- 
relates .30 (p «.01) with the authoritarian 
content of the F-Scale, corrected for response 
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The attitudes of the high social distance sub- 
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very likely to agree strongly with attitude state- 
ments such as "What youth needs most is strict 
discipline, rugged determination and the will 
to work and fight for family and country," or 
"In my opinion patriotism and loyalty are the 
first requirements of a good citizen." "These 
statements are similar in content to some of the 
propaganda messages disseminated by Fascists 
and Nazis before the Second World War. It is 
interesting that the postwar young German 
subjects (born in 1944) tested in the Triandis, 
Davis and Takezawa study rejected such state- 
ments more strenuously than did the Illinois 
subjects tested in the same study. The attitudes 
of the latter subjects toward a large variety of 
social and political issues were measured by 
Triandis et al. and related to the degree of 
social distance shown by the subjects. It was 
found that high social distance subjects indi- 
cated much more conformity to existing social 
institutions and prevailing points of view about 
political behavior, and were more conservative 
and uncritical of these values, than did the low 
social distance subjects. 


SUMMARY 


For normal populations, such as those tested 
in the studies described above, social distance is* 
greatly influenced by cultural norms concerning 
what is appropriate behavior towards persons 
who are “different.” Individuals who are par- 
ticularly sensitive to these norms, and who tend 
to conform and accept uncritically the values 
imparted to them by their culture, are particu- 
larly likely to show large amounts of social 
distance. 
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24. Exploratory Factor Analyses of the Behavioral 
Component of Social Attitudes 


HARRY C. TRIANDIS 


In a recent statement of a theory of attitude 
structure and change, Katz and Stotland (1959) 
examined some of the directions in which re- 
search on attitudes might proceed. They pointed 
out that attempts to find consistent relationships 
between attitudes and behavior might have 
failed because of the use of attitudes with weak 
or nonexistent behavioral components. They 
suggest that 


the assessment of attitudes should include more 
than the measurement of affectivity and evalu- 
ation. It should include measurement of the 
belief component, the behavioral component, 
and the linkage of attitude and its value system 
(p. 465). 


Important contributions to the measurement 
of the evaluative component have been re- 
viewed by Edwards (1957). Osgood, Suci, and 
Tannenbaum (1957) provided a simple proce- 
dure for the measurement of the "meaning of 
an object,” which includes assessment of its 
evaluation. Zajonc (1954) has related some of 
the cognitive and affective elements associated 
with an attitude. Fishbein (1961) has examined 
the relationship between beliefs and attitudes, 
and has provided semantic-differential proce- 
dures for the measurement of the cognitive and 
affective components of attitudes. Peak (1955), 
Rosenberg (1956), and others have examined 
the perceived instrumentality of attitude ob- 
jects for the attaining of goals and values. 

The behavioral component of attitudes, how- 
ever, has generally been neglected. Bogardus’ 


* Reprinted from Journal of Abnormal and Social 
Psychology, 1964, 68, 420-430, with permission of 
author and the American Psychological Association. 
This research was supported by a grant from the 
University of Illinois Research Board. 


(1928) scale of a social distance involves this 
component. Some improvements in the psycho- 
metric characteristics of this scale have been 
made by Triandis and Triandis (1960, 1962), 
but their work also suggested that the dimen- 
sionality of the social distance scale should be 
examined. Although for very rough work the 
social distance scale may be regarded as unidi- 
mensional, more refined work requires examina- 
tion of the dimensions of social behavior. 

The present paper conceives of attitudes as 
consisting of several components, one of which 
is the behavioral component. It focuses on the 


question: "What are the main dimensions of- 


the behavioral component of attitudes?” 

The isolation of dimensions underlying any 
theoretical construct, such as the construct “the 
behavioral component of attitudes,” has several 
advantages: it permits clearer analysis and 
hence better understanding of the theoretical 
construct; it permits explicit measurement of 
the dimensions defining the construct, through 
a few items which are relatively “pure” or inde- 
pendent of each other; and it suggests that 
when behaviors that are closely related to a 
particular dimension are to be predicted from 
attitude measures, it is desirable to give more 
weight to the relevant dimension, and less 
weight to the other dimensions of the behav- 
ioral component of attitudes. 

This paper presents a methodology for the 
analysis of the behavioral component of atti- 
tudes, and some examples of the kinds of data 
and results that are obtainable with this meth- 
odology. 

The variables used in this exploration create 
a cube of data. One face of this cube consists 
of stimulus persons that may vary in race, Te 
ligion, social class, age, sex, and various per- 
sonality characteristics. The second face of the 


m SPI 
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cube consists of behaviors that a person might 
undertake in relation to such stimulus persons 
—for example, have a cocktail with them, ex- 
clude them from the neighborhood, elect them 
to political office, etc. On the third face of the 
cube are the characteristics of the subjects in 
the investigation—their demographic, person- 
ality, and other characteristics. A fourth dimen- 
sion of exploration might be the situation or 
behavior setting (Barker and Wright, 1954) in 
which the interaction takes place. In the pres- 
ent exploration, the situation was not specified. 

The procedure was an extension of that em- 
ployed by Triandis and Triandis (1960, 1962) 
and by Triandis (1961). A typical judgment 
made by a subject is exemplified by the follow- 
ing item: 


A 50-year-old, Negro, Roman Catholic, phy- 
sician, male 


would :—;—:—:—:—:—:—:—:—: would not 
have a cocktail with this person 


The data can be analyzed in several ways. 
One may ask: “How are the behaviors corre- 
lated with each other?” or “How are the stim- 
ulus persons related to each other?” or “What 
kind of subjects give one kind of result and 
what kind give another?” A variety of research 
projects might be undertaken to explore this 
cube of data. 


METHOD 


Selection of Stimulus Persons 


Previous work on social distance (Triandis, 
1961; Triandis and Triandis, 1960, 1962) has 
used race, social class, religion, nationality, and 
philosophy of life as the characteristics of the 
stimulus persons. Studies of employability have 
used stimuli differing in age, sex, race, religion, 
social class, competence, sociability, and phys- 
ical and social disability (Rickard, Triandis, 
and Patterson, 1963; Triandis, 1963). Factorial 
designs were employed, usually with two levels 
of each characteristic (for example, white versus 
Negro, highly competent versus barely compe- 
tent). From these studies it was determined that 
race, sex, age, occupation, and religion are very 
important characteristics of stimulus persons 
and ought to be included in the first explora- 
tion of the cube of data mentioned above. 


The following levels of the five characteristics 
were used: Race—Negro-white; Sex—male-fe- 
male; Age—20-50; Occupation—physician-soda- 
fountain clerk; and Religion—Protestant-Roman 
Catholic-Jewish. Thus, the stimuli formed a 
2x2x2x2x53 factorial design. This design 
required 48 stimulus persons. However, since 
unrealistic stimuli might antagonize the sub- 
jects, the 20-year-old physicians and the Jewish- 
Negroes were dropped from the design. In 
addition to the stimuli mentioned above, four 
"white, male, Protestant, 50-year-old" stimuli 
that might be important in a subject's "person 
life space" were included: A person doing the 
same work as the subject, the subject's super- 
visor, a person who works for the subject, and 
a person who represents a labor union in the 
subject's place of work. A total of 34 complex 
stimuli were used. 


Selection of Behaviors 


The behaviors were selected through a con- 
tent analysis of novels. A random sample of 80 
American novels written after 1850 was taken 
from the University of Illinois library, and a 
random sample of 10 pages was selected from 
each novel. The 800 pages were carefully read 
and every social behavior was recorded. A social 
behavior was defined as a situation in which a 
person does something in relation to, or with, 
another person. A list of approximately 700 be- 
haviors was obtained. If two behaviors were 
extremely similar (for example, "have a drink 
with" and "have a cocktail with"), one of them 
was dropped from the list. Behaviors that ap- 
peared only once in the sample of 800 pages 
and appeared to have no social significance 
were also dropped (for example, "Lift on my 
shoulders," "Push button in elevator for"). Be- 
haviors that were judged as being too ambigu- 
ous (for example, “Astound”’) were also dropped. 
This procedure reduced the 700 behaviors to 
165. 

Since a further reduction in behaviors was nec- 
essary, a pretest questionnaire was administered 
to various psychology classes. The questionnaire 
required judgments of how six stimulus persons 
might be involved with the subject in the 165 
behaviors. The six stimulus persons were speci- 
fied only as to occupation, which ranged in 
prestige from scientist to street cleaner. Thus, 
6 x 165 or 990 judgments had to be made on 
9-point scales, The typical subject completed 
only about two-thirds of these judgments. Since 
the order of presentation was counterbalanced, 
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approximately an equal number of subjects 
judged each item. 

Three sets of instructions were tried in this 
pretest. The instructions used by Osgood et al. 
(1957) asked the subjects to indicate the extent 
of the relationship between the person (concept) 
and the behavior (scale). Another set of instruc- 
tions asked the subjects to judge whether “it is 
likely” that he would behave in the way specified 
by the scale with the stimulus person. The third 
set of instructions used the word "appropriate" 
instead of the word "likely." For each set of in- 
structions, the medians of the distributions of 
the judgments of the subjects, judging a partic- 
ular person on a particular scale, were recorded. 
Inspection of these medians indicated no impor- 
tant differences between the three instructions. 
However, since the semantic-differential instruc- 
tions tended to result in more discrimination 
between the high- and low-prestige stimulus 
persons they were adopted. 

Further examination of the medians revealed 
that some behaviors provided very little discrim- 
ination between the six stimulus persons. The 
criterion of a difference between the medians 
of at least 2 scale units, for any pair of stimulus 
persons, was adopted for retention of the scale 
in the sample. 

Finally, the distribution of the judgments of 
the subjects, when judging a particular stimulus 
person on a particular scale, was used as an in- 
dication of ambiguity. Those behaviors which 
Provided distributions with an interquartile 
Tange of more than 5 scale units were elimi- 
nated, since ambiguity was suspected. The 
above two procedures reduced the number of 
behaviors to 105. 

The 105 remaining behaviors were studied 
according to Guttman's (1959) facet-analysis pro- 
cedures. The behaviors could be classified into 
two facets: One facet dealt with the kind of 
relationship that was implied by the behavior— 
subordinate, coordinate, or superordinate; the 
other facet with the behavior area—social, cog- 
nitive, emotional, physical, or moral. For in- 
stance, the behavior "Believe" was classified as 
cognitive subordination; “Make love to" as emo- 
tional coordination; "Bully" as physical super- 
ordination, etc. In order to reduce the sample 
and at the same time make it maximally heter- 
ogeneous, only one behavior per cell was re- 
tained. For instance, “Believe” and “Learn 
from : were both classified in the cognitive sub- 
ordination cell, and only one of them was re- 
tained in the sample. In deciding which one to 


retain the experimenter used his own judgment, 
but an attempt was made to retain as many of 
the Bogardus scale items as possible. By these 
steps a final sample of 61 behaviors was achieved. 


Subjects 

Since the present research was purely explor- 
atory, an attempt was made to sample student 
subjects that were as heterogeneous as possible. 
A class of 35 beginning introductory-psychology 
students took the complete questionnaire (61 
behaviors x 34 stimulus persons) which re- 
quired about 3 hours. A sample of 85 “sophisti- 
cated” subjects, who had almost completed the 
experimenter’s course in social psychology 
(where social norms, prejudice, etc., had been 
discussed) were also given the questionnaire. 
Each of the subjects in this group completed a 
random third of the questionnaire in one class 
period. The responses of the subjects in each 
sample to a particular questionnaire item 
(stimulus person-behavior combination) were 
summed, and divided by the number of subjects 
who had responded to the item. These average 
judgments formed a 61 x 34 matrix (behaviors 
X stimulus persons). The behaviors were then 
correlated with each other across the 34 stimulus 
persons. "Thus, a matrix of 61 x 61 correlations 
was obtained from each sample. In spite of the 
differences between the two samples, the corre- 
lation between the 1,830 correlation coefficients 
obtained from the first sample with the 1,830 
corresponding coefficients obtained from the 
second sample was .92. Thus, neither psycho: 
logical training nor the partial completion of 
the questionnaire by different subjects seems to 
have affected the correlations between the be- 
haviors. For this reason, the data obtained from 
the two samples were merged. 

The merged data were then divided according 
to the subject's sex and religion. The criterion 
of at least 10 subjects in a particular subsample 
was adopted. Analyses for the Roman Catholic 
and Jewish women are not available, because 
the number of such subjects was insufficient. 

Since the research that is to be presented was 
conceived of as purely exploratory, no attempt 
was made to obtain random, or even adequate, 
samples of subjects. The main goal was to dis 
cover whether this kind of approach is likely 
to be fruitful. Much further work is required 
before generalizations applicable to the United 
States population, or some fraction of this pop- 
ulation, can be made. In the discussion which 
follows, the terms “Protestant females," “Jewish 
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males," etc, are merely being used to describe 
particular samples. No inferences concerning all 
Protestant females or all Jewish males should 
be made from these data. It is hoped that pub- 
lication of this report will suggest similar studies 
to investigators having access to populations 
that are not available to the present investi- 
gator. If a number of similar studies, employing 
the present design, are undertaken, it may be 
possible to extrapolate to- some fraction of the 
United States population. 

In addition to the above samples, a class of 
37 introductory-psychology students was given 
the California F Scale and the 10 highest and 
10 lowest students in the distribution were in- 
vited to take the questionnaire. 

Thus, the analyses that are to be reported 
were based on complete data (2,640 judgments) 
of 10 High F Scale subjects (authoritarians); 
10 Low F Scale subjects (nonauthoritarians); 
22 Protestant, 16 Roman Catholic, and 10 Jew- 
ish men; and 17 Protestant women—a total 
of 85 subjects. All subjects were white. 


Treatment of the Data 


The judgments of the subjects in a particular 
sample, for a particular stimulus person-behav- 
ior combination, were summed and divided by 
the appropriate N. These average judgments 
formed a matrix of 61 rows (behaviors) x 34 
columns (stimulus persons). The rows were cor- 
related, using Ns of 34. In this way, a matrix 
of 61 x 61 correlations between behaviors was 
obtained. Since a factor analysis of the behav- 
iors could not be undertaken unless only 
34 — 1 — 38 behaviors were considered, the 61 
behaviors were divided into two sets of 28 with 
an overlap of 5 marker variables. The division 
was done so that variables highly correlated 
with each other would belong to different sets. 
The two sets of 38 behaviors were factor ana- 
lyzed independently. Furthermore, one set of 
33 behaviors was factored for each of the 
sub-samples—authoritarians, nonauthoritarians, 
Protestants, Roman Catholic, and Jewish males, 
and Protestant females. 

The highest value in each column was used 
as an estimate of the communality. To expedite 
the analysis, both the ILLIAC and 1401 IBM 
computers were used simultaneously, with some 
of the samples being processed by one and some 
by the other computer. Principal-axes analyses 
were performed on ILLIAC and centroid factor 
analyses on the 1401 (for which there was no 
principal-axes program). For the data obtained 


from the Protestant males, both centroid- and 
principal-axes analyses were performed. In all 
analyses seven factors were extracted. Five of 
these turned out to be meaningful and were 
rotated to simple structure by the Kaiser (1958) 
varimax procedure. In the Results section the 
method of factor analysis is indicated. 

Once the factors were rotated, factor scores 
of the stimulus persons on each of the five fac- 
tors were obtained by means of the matrix 
equation F — (4^ 4)-1 A’ Z, where F is the 
matrix of the factor-score estimates, 4 the 
matrix of factor loading, A’ the transposed A 
matrix, and Z the matrix of the scores of the 
stimulus persons on the behaviors in standard 
score form (see Harman, 1960, pp. 360-361, for 
complete discussion). Thus, each of the 34 stim- 
ulus persons acquired five scores, one score on 
each of the five factors. The entries of the 
missing cells corresponding to unrealistic stim- 
uli (for example, the Negro, Jews, and 20-year- 
old physicians), were estimated from the entries 
in the adjacent cells. The characteristics of the 
stimulus persons were viewed as the "treat- 
ments" in an analysis of variance design, and 
the factor scores obtained by each of the stimu- 
lus persons on a particular factor were con- 
sidered as the dependent variable. Thus, for 
each of the subsamples of subjects, it was pos- 
sible to do five analyses of variance—one for 
each of the five factors—and determine the 
amount of variance that each of the character- 
istics of the stimulus persons determined on 
each of the five factors. The advantage of this 
last step in analysis is that it indicates what 
characteristics of the artificial stimuli, or what 
interactions of these characteristics, are relevant 
in the determination of the variance of scores 
on each factor. 


RESULTS 


The analyses of the undifferentiated samples 
of subjects proved unclear. When the sex and 
religion of the subjects are not considered, 
confusion of scales such as "Marry" makes the 
resulting factors not comparable to the factors 
obtained from differentiated samples. For this 
reason, the discussion below will focus on the 
results obtained from the differentiated samples. 


Factors Obtained from the 
Protestant Male Sample 

Table 1 shows the behaviors that loaded on 
the factors obtained from the Protestant male 
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TABLE 1. Factors Obtained from Protestant Male Sample 


From Second Set of 33 Variables 


Varimax 
Rotation 
Loadings 


From First Set of 3$ Variables 


Varimax Rotation 
Loadings 


Social Behavior Principle- Social Behavior 
Axes Centroid Centroid 
Factor Factor Factor 
Analysis | Analysis Analysis 
Factor I: Formal Social Acceptance with Subordination versus Rejection with Superordination 
Admire character of Obey .88 
Cooperate in political cam- Ask for opinion of .88 
paign with* Believe 85 
Plead with Disregard opinion of —.84 
Elect to political office Give dinner party in the 
Drop name from list of my honor of .80 
dinner guests Cooperate in political 
Exclude from first class campaign with* 76 
hotel Not vote for —74 
Praise suggestions of Accept help from 69 
Invite to my club Not change places with —.66 
Admire ideas of Invite to a large party 65 
Accept this person calling Work for 57 
me by my first name Accept as an intimate 
when I do not call him friend* 53 
by his (hers) 
Be commanded by 
Accept as an intimate 
friend* 
Percentage of total 
variance accounted 
for 26 
— | 


Fall in love with Marry* 


.96 


Marry* Physically love 
"Teach Go out on a date with 95 
Smile at Kiss the hand of 93 
Lose game when this person Love even after his (her) 

is my competitor death 91 


Percentage of total 
variance accounted 
for 


Factor III: Friendship Acceptance versus Rejection 


Eat with 


Win game in which this 


person is my competitor Go fishing with 

Be partners with in an Drink with .69 
athletic game Accept as an intimate 

Gossip with friend* 68 

Accept as an intimate Accept as chum 67 
friend* 


Be on a first name basis 
with 
Percentage of total 
variance accounted 
for 


Factor IV: Hostile Acceptance versus Social Distance 


Prohibit from voting Exclude from my neigh- 


Permit to do me a favor borhood* 719 
Admit as a tourist in my Accept as a close kin by 

country marriage —-72 
Accept as an intimate Work with —.63 

friend* Discuss moral issues with —.50 
Prohibit the admission of Accept as an intimate 

this person to my club friend* —24 


Percentage of total variance 
accounted for 


* Marker variables. 
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TABLE 1 (Continued) 


From First Set of 3$ Variables 


Social Behavior Principle- 


Treat as a subordinate* 
Admire ideas of 
Lose game to 
Be commanded by 
Recommend the employ- 
ment of in a firm in 
which I have “an 
influence” 
Percentage of total 
variance accounted 
for 


* Marker variables. 


subjects, for both sets of 33 behaviors. Factor 
I was called “Formal Social Acceptance with 
Subordination versus Formal Social Rejection 
with Superordination.” For both sets of 33 vari- 
ables, the behaviors with the high loadings im- 
plied either formal acceptance (“Cooperate in 
political campaign with,” “Give dinner in 
honor of,” “Vote for,” etc.) or subordination 
("Obey," “Admire character of,” etc). The 
positive pole of this factor suggests behaviors 
that are appropriate towards social superiors 
and the negative-pole behaviors that are ap- 
propriate toward social subordinates. Factor II 
is clearly a marital acceptance factor. Factor 
III suggests same-sex friendship acceptance. 
Factor V is a pure subordination-superordina- 
tion factor, suggesting interaction with super- 
visors and other persons of "rank." 

Factor IV obtained from the first set of 33 
behaviors differs from the corresponding factor 
obtained from the other set of behaviors. The 
highest loadings included the “Prohibit from 
voting” behavior, but there was an insignifi- 
cant loading on “Exclude from my neighbor- 
hood.” Thus, it appears to be a “Hostile 
Acceptance with Voting Prohibition” factor. 
The second set of behaviors, due to a more 
fortunate sampling of variables in that set, 
seems to have yielded a fairly clear “Social 
Distance” factor. 

The meaning of the factors may be clarified 
by the analyses of variance of the factor scores 
of the stimulus persons. Thus, for the sample 
of Protestant males, the analysis of variance 


Varimax Rotation 
Loadings 


From Second Set of 33 Variables 


Varimax 
Rotation 
Loadings 


Centroid 
Factor 
Analysis 


Social Behavior 


Treat as a subordinate* 


Command —55 
Change places with 42 
Work for 4 


Obey 


of factor scores on Factor I, shown in Table 2, 
gave the following results: Formal Social Ac- 
ceptance with Subordination was more likely 
with white than with Negro stimulus persons 
(p <.001) and race controlled 38.4% of the 
variance on that factor; physicians were more 
accepted than soda-fountain clerks (p < .001) 
and occupation controlled 37.8% of the vari- 
ance; persons of the same religion as the 
subject (Protestants) were more acceptable 
(p <.001) and religion controlled 4.4% of the 
variance; persons who were 20 years old were 
more acceptable (p < .001); same religion phy- 
sicians were particularly acceptable (p < .001); 
and Roman Catholic women were much less 
acceptable than Roman Catholic men—the 
Sex x Religion interaction was significant (p < 
.01) and controlled 1.1% of the variance.! 


Factors Obtained from Other Samples 


Since the analyses of the other samples are 
based on relatively few cases, the data will not 
be presented.? The data suggest that there are 


1 Factor II, Marital Acceptance, showed the prefer- 
ence of the subjects for marital partners of the 
same age, race, and religion, and of the opposite 
sex. Factor III showed a preference for friends of 
the same age, sex, religion, and race. Factor IV 
showed prohibition from voting for 20-year-old 
non-Protestants, and Factor V, subordination to 
physicians. 

2 The results of the other samples—Tables A-H— 
have been deposited with the American Documen- 
tation Institute, Order Document No. 7807 from 
ADI Auxiliary Publications Project, Photoduplica- 
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interesting differences between the subsamples. 
Thus, though similar factors appear across the 
various samples, some differences in emphasis 
are observable, as indicated by different load- 
ing patterns, In addition, the difference be- 
tween the High and Low F Scale subjects was 
in the expected direction. The High F Scale 
subjects produced only three clear factors, while 
the Low F Scale subjects produced the usual 
five clear factors. The simpler structure ob- 
tained from the authoritarian subjects is con- 
sistent with the categorical thinking of the 
subjects, as discussed in the literature. 

A suggestion of the nature of these factor 
differences, across samples, is provided by Table 
3. 


Congruence of Factors across Samples 


In order to obtain an objective measure of 
the similarity of the obtained factors across 
the various samples, coefficients of congruence 
(Tucker, 1951) were computed between all 
possible combinations of factors. Table 3 sum- 
marizes some of the congruity coefficients, as 
well as the other results. 

A relatively rigorous criterion for acceptance 
of the “existence” of a factor is a coefficient of 
congruence of .50 or more with some factor 
obtained from another sample. This criterion 
may be improper, since it is conceivable that 
a factor which is unique to one religious group 
may exist. However, the Formal Social Accep- 
tance with Subordination, Social Acceptance 
with Condescension, and Marital Acceptance 
factors clearly meet this criterion. At a some- 
what less rigorous criterion level, the Friend- 
ship Acceptance and Subordination factors also 
seem clear. The relative lack of clarity of the 
Social Distance factor suggests that there was 
conflict and nonagreement about social dis- 
tance, As a result, it was unclear what behav- 
iors, beyond nonacceptance in the neighborhood 
and as a kin by marriage, were considered 
appropriate towards persons differing in race. 


DISCUSSION OF OBTAINED FACTORS 


'The present study was not designed to ob- 
tain factors from samples of subjects differing 
in religon and sex. This separation was neces- 
sary to obtain maximum clarity, and much of 
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the discussion deals with the data obtained 
from these samples. However, since the samples 
of subjects were so unsystematic, nonrandom, 
and small, no general statements can be made 
about the religion and sex of subjects as deter- 
minants of the obtained factors. 

The study was undertaken as an exploration 
of the way subjects cluster social behaviors. 
It would have been perfectly feasible to factor 
the correlation matrix of the behaviors of every 
single subject and to discuss his or her way of 
clustering behaviors. Since this would have re- 
quired 85 factor analyses (1 for each of the 
subjects) and presentation of the data would 
have been very difficult, it was decided to 
average the responses of homogeneous subjects. 
This method successfully provided a first ap- 
proximation of some of the factors used to 
structure the social relations space. 

Thus, it is likely that typical subjects, in our 
culture, use some combination of the factors 
presented in Table 3. Some of the factors that 
show considerable congruence across samples 
are probably sufficiently common to be used 
by most people. For example, the Formal So- 
cial Acceptance with Subordination factor 
seems to be widely used. The General Social 
Acceptance factor, typical of the authoritarian 
subjects only, shows poor congruence (only .65) 
with it, but is a very similar factor. The Social 
Acceptance with Condescension factor, found 
only with the Jewish males, shows relatively 
poor congruence (.62) with the Formal Social 
Acceptance with Subordination factor, but is 
also essentially the same. Thus, this factor may 
be said to be present in all the samples. The 
Marital Acceptance and Friendship Acceptance 
factors, although showing less congruence, are 
also essentially present in most of the sample. 
The Social Distance and Subordination-Super- 
ordination factors are less ubiquitous, but are 
present in some form or other in most samples. 

The remaining factors are less general. How- 
ever, most of the "typical scales" of Table 3 
should be included in future work, in order 
to explore their generality. 

The Formal Social Acceptance with Subor- 
dination factor appears to include behaviors 
that are appropriate in relation to social su- 
periors or inferiors with whom the subject has 
no institutionalized relationship. By contrast, 
the Subordination-Superordination factor ap- 
pears to group behaviors appropriate for inter- 
action with supervisors or subordinates. The 
Marital Acceptance factor summarizes inter- 
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actions appropriate with the opposite sex. Race, 
age, and religion determined significant amounts 
of variance on this factor. The Friendship 
Acceptance factor also appears meaningful. 
Protestant males preferred same sex, age, reli- 
gion, and race persons as friends. The Social 
Distance factor seems to be reflecting mostly 
the rejection of Negroes. This is consistent 
with previous findings (Triandis and Triandis, 
1960, 1962). 

The validity of this method of analysis can 
be intuitively evaluated by examining the load- 
ings of various behaviors on a factor on which 
there is known sex-role differentiation. For 
example, a number of behaviors had different 
loadings on the Protestant male and female 
Marital Acceptance factors. The male factor 
had a loading of .63 on the behavior “Teach,” 
while the female factor had a loading of — .10. 
This is consistent with sex-role differentiation 
in our culture. 


GENERAL DISCUSSION 


The present study has accomplished three 
things: It demonstrated the general method- 
ology for the construction of Behavior Differ- 
entials, and reduced the 61 social behaviors to 
a more limited number of 20 to 25 behaviors 
that might be used in future work; it demon- 
strated the method of analysis of the cube of 
data formed by the stimulus persons, the social 
behaviors, and the subjects rating the stimuli 
on the behaviors; and it showed that the ex- 
ploration of the subjects’ structuring of the 
social relations space, by means of the Behavior 
Differential, is feasible and leads to “sensible” 
results. 

The present research also has implications 
for studies of social distance. Previous writers 


3 A related analysis may be mentioned parenthet- 
ically. When the 34 stimulus persons were corre- 
lated over the 61 behaviors and the resulting 
34 x 34 matrix factored, four factors accounted for 
949, of the observed variance. Factor I seemed to 
involve stimulus persons towards whom there is 
much cultural determination of behaviors. The 
low-status Negro women obtained high loadings 
on this factor; the low-status white women and the 
high-status Negroes, low loadings. Factor II was 
a status factor, with the 50-year-old physicians 
obtaining high loadings and the 20-year-old soda- 
fountain clerks, low loadings. Factor III contrasted 
the "datable" stimuli (20-year-old white women) 
with the other stimuli. Factor IV contrasted the 
unusual stimuli (for example, 50-year-old Negro 
woman physician) with the common stimuli. 


on social distance, including the present writer, 
have assumed that social distance is a unidi- 
mensional construct. If this were the case, the 
Marital Acceptance factor would represent the 
least amount of social distance, the Friendship 
Acceptance factor the next least, and the 
Formal Social Acceptance factor the next least; 
the subjects would answer three items repre- 
senting these factors as shown in Table 4. 'The 


TABLE 4. Responses Which Would Occur to 
Three Items if Social Distance Were 
Unidimensional 


Formal 
Mar- Friend- Accep- 
riage  ship*  tancet 


Minimal social 


distance Yes Yes Yes 
Slight social distance No Yes Yes 
Moderate social 

distance No No Yes 
Substantial social 

distance No No No 


* For example, “Go fishing with.” 
t For example, “Vote for.” 


existence of response patterns such as those 
shown in Table 5 suggests that these three items 
are independent. Such patterns clearly make 
sense. The data also suggest that for authori- 
tarian subjects, the correlation between “Marry” 


TABLE 5. Responses Indicating Independence 
of Three Items 


Formal 

Mar- Friend- Accep- 

riage ship* tancet 
Subject, Yes Yes No 
Subject, No Yes No 
Subject, Yes No No 


* For example, “Go fishing with.” 
t For example, “Vote for." 


and "Vote for" was slightly negative (— .16, ns). 
Between "Marry" and "Elect to political office" 
it was — 51 (p < .001). Since it is inconsistent 
with Guttman's (1959) concept of unidimen- 
sionality to be willing to marry someone, but 
not be willing to accept him as a political 
candidate, the domain explored by the present 
study is clearly not unidimensional. It might 
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be argued that Bogardus (1928) was aware of 
the multidimensionality of the domain of so- 
cial behavior statements, and for this reason 
was careful to employ only very few statements 
falling on a single dimension. This is correct. 
However, when more precise work requires a 
scale with more than a few items, the unidi- 
mensionality of the scale breaks down. This 
does not mean that unidimensional social- 
distance scales with few items are impossible 
to construct and are not useful for certain 
kinds of research. On the contrary, such scales 
are adequate as long as the research does not 
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25. A Multiple-Indicator Approach to Attitude Measurement 


STUART W. COOK and CLAIRE SELLTIZ 


At least since LaPiere's report (1934) of the 
discrepancy between the actual reception ac- 
corded him and a Chinese couple and the 
answers to a questionnaire about accepting 
Chinese as guests, investigators have been con- 
cerned with the fact that different procedures 
designed to assess the same attitudes have often 
led to quite different placements of the same 
individuals, and that observed behavior toward 
a social object (person, group, etc) is fre- 
quently not what would have been predicted 
from a given instrument intended to measure 
attitude toward that object. There have been 
several types of reaction to such observed dis- 
crepancies. One has been to assume that there 
is a "true" attitude toward the object, which 
one or both measures have failed to gauge 
correctly. A second has been to assume that 
there are different "classes" of attitudes toward 
à given object—for example, "verbal attitudes" 
and "action attitudes"—which should not nec- 
essarily be expected to correspond. Another 
has been to equate attitude with behavior, us- 
ing "attitude" simply as a descriptive term 
summarizing observed consistencies in behavior. 
Still another reaction has been to think of 
attitude as an underlying disposition which en- 
ters, along with other influences, into the deter- 
mination of a variety of behaviors toward an 
object or class of objects, including statements 
of beliefs and feelings about the object and 
approach-avoidance actions with respect to it. 

We prefer the latter position; first, because 
for us, as for others (e.g, Allport, 1954) the 
observation of regularities in social behavior 
seems to point to the operation of relatively 


* Reprinted from Psychological Bulletin, 1964, 62, 
36-55, with permission of the senior author and the 
American Psychological Association. This paper 
was prepared as part of a program of research on 
the measurement of social attitudes supported by 
grants from the National Science Foundation and 
the Air Force Office of Scientific Research. 
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stable underlying dispositions toward classes of 
objects. Further, we believe that apparent in- 
consistencies in social behavior may often best 
be understood in terms of the operation of 
such stable underlying dispositions in shifting 
relation to other influences on behavior. Fi- 
nally, if validly distinguished, a dispositional 
concept has, by its very nature, a wider range 
of situational relevance—including projecta- 
bility into relatively novel situations—than a 
simple descriptive concept equating attitude 
with behavior in specified situations. 

We assume that two classes of variables, in 
addition to an individual's attitudinal dispo- 
sition toward a given object or class of objects, 
influence his behavior in situations involving 
the object or symbols of the object (including 
the behavior constituting his responses to in- 
struments designed to measure attitude toward 
the object): (a) other characteristics of the indi- 
vidual, including his dispositions toward other 
objects represented in the situation, values he 
holds that are engaged by the situation, his 
motivational state, his expressive style, and so 
on; (b) other characteristics of the situation, 
including its prescriptions as to appropriate 
behavior, the expectations of others in the situ- 
ation with respect to the individual's behavior, 
the possible or probable consequences of vari- 
ous acts on his part, and so on. 

In this view, an attitude cannot be measured 
directly, but must always be inferred from be- 
havior—whether the behavior be language in 
which the individual reports his feelings about 
the attitude-object, performance of a task in- 
volving material related to the object (eg 
recall of statements which take a position with 
respect to the object), or actions toward a rep- 
resentative of the object-class (e.g., avoidance 
of such an individual). Lazarsfeld (1959) takes 
a similar position in his discussion of latent 
structure analysis. He points out that there is 
a probability relation between an indicator 
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and the underlying trait of which it is taken 
as an indication; that is, a given trait does not 
invariably produce a given behavior. He 
stresses that, in consequence, some inconsis- 
tency will always be found between different 
measures of a hypothesized trait, and that the 
task of the investigator is to combine them into 
an "index" or "measurement" which represents 
the best inference that can be made from the 
manifold of empirical operations to the under- 
lying characteristic they are assumed to reflect. 

This orientation leads to emphasis on the 
need for a number of different measurement 
approaches to provide a basis for estimating 
the common underlying disposition, and to the 
expectation that data from these approaches 
will not be perfectly correlated. However, it 
seems to us that it should be possible to in- 
crease the correspondence among the indicators 
by careful analysis of other factors that are 
likely to affect response to a given measuring 
instrument and by efforts to reduce or control 
the influence of those factors. Ideally, the goal 
would be to develop one or more measures 
from which the effects of all probable response 
determinants other than attitude toward the 
relevant object would be removed. This goal, 
however, seems unlikely of achievement; there- 
fore it seems to us important to work with a 
number of different measures, in each of which 
an effort is made to eliminate or control in 
some systematic way some identifiable influence 
on response other than the attitude in question. 
Since different influences will be controlled in 
different measures—and thus, conversely, dif- 
ferent influences in addition to attitude will 
affect responses on the different measures— 
there will remain a lack of full correspondence 
among scores on the different measures. 

Social scientists have long recognized that 
factors other than an individual's attitude to- 
ward an object may influence both his response 
to instruments designed to measure the attitude 
and his behavior toward the object in everyday 
life. Much recent work in the field of both 
personality and attitude measurement has been 
concerned with identifying the effects of such 
“extraneous” variables as the tendency to agree 
(or to disagree) with statements regardless of 
their content (e.g, Bass, 1955; Cronbach, 1946, 
1950) or the wish to give a socially acceptable 
picture of oneself (e.g., Edwards, 1953, 1957; 
Taylor, 1961). Another interest has been in the 
development of indirect methods of attitude 
assessment (for a review of such methods, see 


Campbell, 1950). But attempts to develop in- 
direct measures have, for the most part, been 
sporadic, and there has been little effort to 
examine systematically the relation of different 
indirect measures to each other or their rela- 
tive susceptibility to such influences as agreeing 
response set or social norms. 

Despite the general awareness of measure- 
ment problems, examination of reports of ex- 
perimental research on attitudes shows the 
following picture: First, even investigators who 
hold very sophisticated theoretical positions 
about the nature and functions of attitudes 
and the conditions for attitude change com- 
monly use only a single attitude measure— 
typically quite crude—in testing hypotheses 
derived from those theoretical positions. Second, 
most investigators are aware of the possibility 
that responses to these instruments may be 
influenced by factors other than the attitudes 
they are intended to measure. Third, efforts 
are made to guard against the intrusion of such 
factors or to rule out interpretations based on 
the possibility that they have been operative. 
These safeguards usually take one or more of 
the following forms: sampling (e.g., selection 
of groups of subjects believed to differ in sus- 
ceptibility to the extraneous influences most 
likely to be operative in the measurement 
situation), experimental design (e.g., the intro- 
duction of control groups), internal analysis of 
the data (e.g., considering how the responses 
of subgroups of subjects might be expected to 
differ if one determinant rather than another 
were operative). 

We do not mean to minimize the importance 
of such procedures. In any given study they 
may quite convincingly rule out the possibility 
that responses have been influenced by factors 
other than subjects’ attitudes toward the object 
in question. Nevertheless, it seems to us that 
effort directed toward improving measuring 
instruments might be at least equally useful. 


AN EXAMINATION OF DIFFERENT TYPES OF 
MEASURING INSTRUMENTS IN TERMS OF 
THE KINDS OF EVIDENCE THEY PROVIDE 
AS A BASIS FOR INFERENCES ABOUT ATTI- 
TUDE 


In most current research on attitudes, efforts 
directed specifically toward improving measur- 
ing techniques are limited to such matters as 
assuring anonymity, attempting to separate the 
measurement from the experimental sessions, 
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varying the order of presentation of items or 
the context in which they are embedded. If 
“we are to go beyond such limited steps, a more 
Systematic analysis of the characteristics of mea- 
suring instruments is needed than is yet avail- 
able. This paper is a first step toward such an 
analysis. Our purpose is not to present a de- 
tailed review of the different kinds of instru- 
ments that have been used to measure attitudes; 
this has been well done by others (Campbell, 
1950; Deri, Dinnerstein, Harding, and Pepi- 
tone, 1948; Weschler and Bernberg, 1950). 
Rather, we propose to examine broad classes 
of measurement techniques from the point of 
view of the kinds of evidence they provide and 
thus the nature of the inferences involved in 
estimating attitude. By “the nature of the infer- 
ences involved" we mean the grounds for be- 
lieving that attitude toward the presumed 
object is a determinant of responses to the 
measuring instrument, and the bases for infer- 
ring the nature of the attitude from the charac- 
teristics of the responses (i.e, for considering 
à given response as indicative of a positive or 
a negative disposition toward the object). 
We have found it useful to think in terms 
of five major groupings: (a) measures in which 
the material from which inferences are drawn 
consists of self-reports of beliefs, feelings, be- 
havior, etc., toward an object or class of objects; 
(b) measures in which inferences are drawn 
from observed overt behavior toward the 
object; (c) measures in which inferences are 
drawn from the individual's reactions to or 
interpretations of partially structured. material 
relevant to the object; (d) measures in which 
inferences are drawn from performance on 
objective tasks where functioning may be influ- 
enced by disposition toward the object; and 
(c) measures in which inferences are drawn 
from physiological reactions to the object. Not 
all of the measures discussed have been used 
as attitude tests in the formal sense, but for 
each of them there is reason to believe that 
attitude may be an important determinant of 
response and thus that the technique could 
Serve as a basis for inferences about attitude. 
In assessing the adequacy of an instrument 
as an indicator of attitude, consideration of 
its susceptibility to other influences is as im- 
portant as consideration of the grounds for 
believing that underlying disposition toward 
the object is a determinant of response. In 
examining measuring instruments from the 
point of view of the possible influence of fac- 


tors other than attitude, we shall consider two 
major aspects: (a) the probability that overt 
responses may deviate from “private” responses 
—that is, the ease with which an individual 
can alter his responses to present a certain 
picture of himself; (b) the probability that pri- 
vate responses may be influenced by deter- 
minants other than attitude, in the absence of 
any attempt to distort responses. 

Possibilities of influence of private response 
by factors other than attitude are, of course, 
almost limitless; we shall discuss only those 
that seem most probable with respect to each 
type of instrument. Susceptibility of overt re- 
sponse to distortion—that is, the possibility of 
discrepancy between private and overt response 
—would seem to be a function of three char- 
acteristics of the instrument: the extent to 
which its purpose is apparent, the extent to 
which the implications of specific responses 
are clear, and the extent to which responses 
are subject to conscious control. 

In discussing the susceptibility of measures 
to distortion of responses and techniques de- 
veloped to lessen the probability of distortion, 
we assume that with respect to many attitudes 
the settings in which tests are usually admin- 
istered tend to exert pressures in a constant 
direction. It seems reasonable to suppose that 
most respondents, presented with tests in an 
academic setting or under the auspices of some 
other "respectable" organization, will assume 
that the responses which will place them in 
the most favorable light are those which repre- 
sent them as well adjusted, unprejudiced, ra- 
tional, open minded, and democratic. Moreover, 
since these are ideal norms at least in much 
of the American middle class, the pressures 
specific to the test situation are likely to coin- 
cide with inner pressures toward maintaining 
an image acceptable to the self as well as to 
others. By "controversial social attitudes" we 
mean attitudes with respect to which such 
norms are operative. Some of our discussion, 
and especially some of our examples, concern 
techniques for making it easier for the indi- 
vidual to reveal himself as not well adjusted, 
not unprejudiced, etc., or for making it harder 
for him to portray himself, falsely, as well 
adjusted, unprejudiced, etc. While some as 
sumption as to the probable direction of pres- 
sures operating in the situation is necessary for 
the concrete details of certain techniques, the 
principles involved do not hinge on the spe- 
cific direction of pressures; given testing situ- 
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ations in which there is reason to believe that 
the pressures are predominantly in a different 
direction, the techniques can be modified 
accordingly. And many of the techniques re- 
quire no assumption about the probable direc- 
tion of pressures, being designed to reduce the 
effects of extraneous influences in any direction. 


Measures in Which Inferences Are Drawn from 
Self-Reports of Beliefs, Feelings, Behaviors, etc. 


By far the most frequently used method of 
securing material from which to make infer- 
ences about an attitude is to ask an individual 
to reveal—either in his own words or through 
acceptance or rejection of standardized items 
—his beliefs about the attitudinal object, how 
he feels toward it, how he behaves or would 
behave toward it, how he believes it should 
be treated. 

'The basis for inference is clear: it is axiom- 
atic in all definitions that an individual's atti- 
tude toward an object is indicated by his 
beliefs, feelings, and action orientation toward 
it. The nature of the inference is also clear: 
it is assumed that the relationship between 
attitude and expression is a direct one and 
that the attitude corresponds to the manifest, 
common-sense implications of the stated belief 
or feeling. For example, a stated belief that 
the object has characteristics usually considered 
desirable is taken as reflecting a favorable dis- 
position toward it, and a stated belief that it 
has characteristics usually considered undesira- 
ble is taken as reflecting an unfavorable dispo- 
sition. Similarly, a report that the person avoids 
contact with the object is taken as indicating 
an unfavorable disposition toward it, while a 
report that he does or would willingly enter 
into contact with it is taken as indicating a 
favorable disposition. 

In some definitions, attitude is considered 
identical with, or simply a summary of, beliefs, 
feelings, behavior, etc., toward the object; thus 
no problem of inference arises. However, in 
such definitions some criteria must be adopted 
for choosing which behavior constitutes. the 
population of "attitudinal responses" to be 
sampled. The choice of such criteria would, 
we believe, depend upon an analysis essentially 
similar to our consideration of "extraneous 
influences" in the remainder of this paper. 

Self-report measures have a number of char- 
acteristics that make them susceptible to dis- 
tortion of overt responses. The purpose of the 
instrument is obvious to the respondent; the 


implications of his answers are apparent to 
him; he can consciously control his responses. 
Thus a person who wishes to give a certain 
picture of himself—whether in order to im- 
press the tester favorably, to preserve his own 
self-image, or for some other reason—can rather 
easily do so. This difficulty has long been rec- 
ognized, and in recent years it has been exten- 
sively investigated under the rubric of “social 
desirability.” A number of techniques have 
been devised to make the purpose of the instru- 
ment or the implications of the responses less 
apparent; to make it easier to give answers 
that may be considered undesirable; and to 
make it harder to give, falsely, answers that 
may be considered desirable. Some of these 
techniques are focused primarily on reducing 
the likelihood that responses will be distorted 
in an attempt to meet the investigator's expec- 
tations or to please him; others are addressed 
to reducing the influence on responses of a 
desire to maintain a certain self-image as well 
as that of a desire to please or impress the 
investigator. 

One of the simplest approaches to making 
the purpose of the instrument less apparent 
is the inclusion of items not relevant to the 
attitudinal object in which the investigator is 
interested. A variation of this approach is to 
include in each of the items a number of 
aspects in addition to that in which the investi- 
gator is interested; for example, if the inves- 
tigator is interested in attitudes toward one or 
more racial groups, each item may refer to a 
hypothetical person characterized not only in 
terms of race but of age, sex, religion, occu- 
pation, etc. Approaches of either sort serve only 
to make the purpose of the test less obvious. 
They do not competely conceal or disguise it, 
nor can they do so within the format of self- 
report measures, which by definition call for 
the individual's own account of his reactions 
to the attitudinal object. 

Among the simplest, and most frequently 
used, approaches to making it easier to give 
answers that may be considered undesirable 
are assurances of anonymity, statements to the 
effect that "there are no right or wrong 
answers" or that "people differ in their views 
on these things," emphasis on the importance 
of honest answers in order to contribute to 
scientific knowledge or some other presumably 
desirable outcome, efforts to build up rapport 
between questioner and respondent and to 
create the impression that the questioner will 
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not disapprove of whatever views may be 
expressed, 

Other approaches are built into the instru- 
ment itself: including items to which an un- 
favorable reply is likely to be considered 
acceptable (e.g., "Would you be willing to have 
a ditch digger as U. S. Congressman from your 
district?”—Westie, 1952, 1953) in order to 
break down a possible set to give uniformly 
favorable replies; including in the statement 
of a view that may be considered undesirable 
a qualification or a justification of it (e.g. "It 
is best that Jews should have their own frater- 
nities and sororities, since they have their own 
particular interests and activities which they 
can best engage in together, just as Christians 
get along best in all-Christian fraternities"— 
Adorno, Frenkel-Brunswik, Levenson, and San- 
ford, 1950); wording questions in such a way 
that they assume the respondent holds certain 
views or has engaged in certain kinds of be- 
havior (eg. “When did you first . . . ?"— 
Kinsey, Pomeroy, and Martin, 1948). 

Other approaches are designed to make it 
difficult to give, falsely, what may be considered 
a desirable answer. In the measurement of 
personality, a major effort in this direction 
has been the use of forced-choice tests, where 
the respondent is asked to indicate which of 
two statements, matched in terms of social de- 
sirability but differing in their implications 
with respect to traits or needs, is closer to his 
own views or more descriptive of his own 
behavior. This approach has not been exten- 
sively used in the measurement of attitudes. 

In addition to their susceptibility to con- 
scious distortion in order to give the picture 
the individual wishes to present of himself, 
responses to self-report measures may be influ- 
enced by another set of characteristics pre- 
sumably unrelated to attitude toward the object 
in question—characteristics frequently labeled 
"response set” or “expressive style." It has long 
been noted that some individuals have a con- 
sistent tendency to agree (or to disagree) with 
items presented to them, regardless of their 
content; or to select, with more than chance 
frequency, the alternative which appears in a 
given position; or to give extreme (or mod- 
erate) answers. 

A number of techniques have been devised 
to reduce the effects of such tendencies on 
scores that are to be taken as indicative of 
attitudes. Perhaps the simplest and the most 
common approach to the problem of influence 


by a tendency to agree (or to disagree) is to 
vary the wording of items in such a way that 
for approximately half of them agreement rep- 
resents a favorable response to the attitudinal 
object, and for half an unfavorable response. 
Other approaches to this problem involve set- 
ting up the instrument in such a way that 
responses do not take the form of expressing 
agreement or disagreement with one statement 
at a time. The instrument may consist of pairs 
of statements representing roughly opposed 
points of view on a given issue, both state- 
ments being worded positively or both worded 
negatively; the subject is asked to indicate 
which is nearer his own position, or to indicate 
his position on a scale running between the 
two statements. The following pair of items 
from an unpublished scale of attitudes toward 
freedom of speech, developed by students of 
Donald T. Campbell at Northwestern Univer- 
sity, illustrate this approach: 


A. Fascists and Communists are entitled to 
preach their beliefs in this country. 

B. Only those who are in agreement with this 
country’s philosophy of government are 
entitled to preach their beliefs. 


In other instruments, the problem, at least in 
its obvious form, is avoided by using items that 
call for free response—open-ended questions, 
sentence stubs to be completed with the indi- 
vidual's own responses, etc. 

An approach to correcting for the effects of 
a tendency to give extreme answers, or mod- 
erate answers, consists in providing matched 
pairs of items, one referring to the attitudinal 
object, the other referring to some control ob- 
ject, and scoring in terms of the discrepancy 
between the two responses. For example, if 
respondents are asked only, "Would you be 
willing to have a Negro bookkeeper live in the 
same apartment building you live in?" and are 
provided with a 5-point response scale, it is 
impossible to determine whether respondents 
who answer "very willing" differ from those 
who answer simply "willing" in attitude, in 
response style, or in both. Providing a parallel 
item with respect to a white bookkeeper and 
scoring on the basis of discrepancy between 
an individual's responses to the Negro and the 
white removes the effects of response style from 
the score (Westie, 1953). 

Susceptibility of self-report measures to the 
two kinds of influences discussed so far—desire 
to present a certain picture of oneself, and 
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response sets unrelated to the content of items 
—clearly leads to the possibility of distortion 
of responses in the obvious sense of lack of 
correspondence between the overt responses 
and the individual's private beliefs, feelings, 
policy views, etc. Still other factors, however, 
may influence his private beliefs and feelings 
as well as his overt responses. While private 
beliefs, feelings, and action orientations with 
respect to an object are by definition at least 
partially determined by the individual’s atti- 
tude toward the object, they may be influenced 
by other factors as well—for example, by the 
availability of information, or by other values 
the individual holds. Thus, a person who has 
an essentially devaluing attitude toward Ne- 
groes may nevertheless have learned and state 
as his belief that there is no difference in the 
chemical composition of the blood of Negroes 
and whites; on the other hand, a person whose 
disposition toward Negroes is not devaluing 
may know and state as his belief that the aver- 
age scholastic achievement of Negroes in the 
United States is lower than that of whites. A 
person with a devaluing attitude toward Ne- 
groes may nevertheless believe that they should 
not be deprived of the right to vote, because 
he sees this right as an essential ingredient of 
democracy; a person whose attitude toward 
Negroes is not devaluing may be opposed to 
laws forbidding discrimination in the sale and 
rental of housing because he places great store 
on the right of an owner to do with his prop- 
erty as he sees fit. 

To the extent that such other influences 
affect different items differently, or affect only 
certain items, this problem has been attacked 
by examining responses for consistency, elimi- 
nating items which show low agreement with 
total scores, or eliminating those to which re- 
sponses do not fall on a unidimensional scale. 

A given technique may help to reduce or 
correct for extraneous influence from more 
than one source. For example, scoring in terms 
of discrepancy between responses to items con- 
cerning the attitudinal object and comparable 
items about a control object may provide a 
correction for the effects of other values or 
meanings engaged by the items as well as for 
response sets, Asking the respondent to choose 
which of two statements is closer to his views 
may help to eliminate the influence both of 
response set and of concern with the accepta- 
bility of responses, if the alternatives provided 
are equivalent in both respects. 


Not only may a given technique serve more 
than one. function; a given instrument may 
embody a number of techniques designed to 
reduce the influence of extraneous factors. For 
example, in Westie's (1953) Summated Differ- 
ences Test, the subject is presented with hypo- 
thetical persons of specified race (Negro or 
white) and occupation (eight occupations, rang- 
ing from ditch digger to banker, plus “the 
average man”), and asked to indicate, on 
5-point scales, his willingness to accept each 
of these 18 hypothetical persons in each of 24 
relationships—a total of 432 items. Some of 
the items are such that a negative answer is 
likely to be considered acceptable by most 
people (e.g., unwillingness to vote for a ma- 
chine operator, whether white or Negro, as 
President of the United States is not likely to 
be seen as an expression of "prejudice"), thus 
presumably breaking down a possible tendency 
to give uniformly favorable answers whether 
through an acquiescent response set or through 
a desire to give a picture of oneself as unpreju- 
diced. The large number of items, and the 
format of the questionnaire, make it extremely 
unlikely that the subject can remember or 
check his response to a given item with respect 
to one racial group when he is answering the 
comparable item concerning the other group. 
Scoring on the basis of discrepancy between 
parallel items referring to whites and Negroes 
takes account both of possible response sets 
and of the influence of the specified occupation 
and the specified situation. Thus, this instru- 
ment adds to the basic social distance ques- 
tionnaire a number of techniques designed to 
make the focus of the investigator's interest 
less apparent, to make it easier to give answers 
that might be considered undesirable, to cor- 
rect for possible response sets, and to some 
extent to take account of other values or mean- 
ings that may affect responses. 


Measures in Which Inferences Are Drawn 
from Observation of Overt Behavior 


Many investigators have pointed out the 
desirability of using measures in which overt 
behavior toward members of a class of objects 
would serve as a basis for inferences about atti- 
tude toward the object-class. As with self-report 
measures, the basis for inference is clear; all 
definitions of attitude specify that behavior can 
be taken as an indicator of attitude. And, as 
in the case of self-report measures, the usual 
assumption is that there is a simple correspon- 
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dence between the nature of the behavior and 
the nature of the underlying attitude; for 
example, that friendly behavior toward a 
member of a given class of objects indicates a 
favorable attitude toward the object-class. 

There has been much less extensive develop- 
ment of measures of this sort than of self- 
report measures. Situations capable of eliciting 
behavior toward an attitudinal object are more 
difficult to devise and to standardize, and more 
time consuming and costly to administer, than 
self-report measures. Although some measures 
of this type have been devised, they have not 
been widely enough used to provide much 
evidence as to their specific strengths and weak- 
nesses nor to stimulate efforts to correct for 
shortcomings. However, analysis of their char- 
acteristics can provide estimates as to their 
probable susceptibility to influences other than 
attitude and possibilities of reducing such 
susceptibility. 

Attempts to develop behavioral measures 
have followed three general lines. One consists 
in presenting subjects with standardized situ- 
ations that they are led to believe are unstaged, 
in which they believe that their behavior will 
have consequences, and in which the attitudi- 
nal object is represented in some way other 
than by the actual presence of a member of 
the object-class. For example, subjects may be 
asked to sign a petition on behalf of an in- 
structor about to be discharged for membership 
in the Communist party, to contribute money 
for the improvement of conditions for migra- 
tory workers, to indicate whether they would 
be willing to have a Negro roommate. DeFleur 
and Westie (1958) have attempted to develop 
a measure of this sort which is appropriate for 
use in many different testing situations. In 
their procedure, as part of a larger program of 
research, white subjects viewed a number of 
colored photographic slides showing a young 
Negro man and a young white woman, or a 
young white man and a young Negro woman, 
in a social setting; subjects described the 
pictures and answered specific questions about 
them. At the close of an interview following 
this session the measurement procedure being 
discussed here was introduced. DeFleur and 
Westie describe the procedure as follows: The 
subject was told that another set of such slides 
was needed for further research, was asked if 
he (or she) would be willing to be photo- 
graphed with a Negro of the opposite sex, and 
then was given "a standard photograph release 


agreement," containing a variety of uses to 
which such a photograph would be put, ranging 
from laboratory experiments where it would be 
seen only by professional sociologists, to a na- 
tionwide publicity campaign advocating racial 
integration. The subject was asked to sign his 
name to each use of the photograph which he 
would permit. These investigators report that 
subjects "uniformly perceived the behavioral 
situation posed for them as a highly realistic 
request." 

Such devices differ from self-report measures 
with similar content in that, in the behavioral 
measures, the subject either actually carries out 
the behavior (signs a petition, makes a contri- 
bution, etc.) or is led to believe that his agree- 
ment to do so will lead to real-life consequences 
(being asked to pose for a photograph to be 
put to specified uses, being assigned a Negro 
roommate, etc.). 

Another approach is to present the subject 
with an admittedly staged situation and ask 
him to play a role—perhaps to behave as he 
would in such a situation in real life, perhaps 
to take the part of someone else or to act in 
some specified way. Stanton and Litwak (1955) 
presented actual and potential foster parents 
with situations of interpersonal stress in which 
they were instructed to behave in a given way 
(defined as not manifesting specified undesir- 
able or neurotic kinds of behavior); for exam- 
ple, in one scene the subject was instructed 
that he was to play the role of a married man, 
having dinner with his parents; the investiga- 
tor, playing the role of the man's father, treated 
his son like a child, criticized his wife, and put 
him in the wrong. These investigators found 
that ratings based on a half-hour's role playing 
were better predictors of subjects! behavior as 
foster parents (as rated by case workers who had 
sustained contact with them) than were ratings 
based on 12 hours of intensive interviewing by 
a trained social worker. Stanton, Back, and Lit- 
wak (1956) reported that a role-playing ap- 
proach was successful in discovering the limits 
of positive and negative feelings about public 
housing projects on the part of slum dwellers 
in Puerto Rico. These investigators have stressed 
the importance of designing the scene specifi- 
cally to elicit responses relevant to the par- 
ticular behavior or attitude in which the 
investigator is interested. 

A third behavioral approach, used in the 
study of attitudes toward social groups, has 
been to ask for sociometric choices among in- 


"-———— i 


A MULTIPLE-INDICATOR APPROACH TO ATTITUDE MEASUREMENT 227 


dividuals some of whom are members of the 
object group, preferably under circumstances 
that lead the participants to believe that such 
choices will have consequences in the form of 
subsequent assignment in some situation. Early 
applications of this technique to the study of 
intergroup attitudes were made in studies by 
Moreno (see 1943) and by Criswell (1937, 1939), 
in which patterns of choices by school children 
were analyzed in terms of the development of 
cleavage along racial lines. Subsequently, socio- 
metric techniques have been used in research 
evaluating the effects of certain experiences on 
attitudes (e.g, Mann, 1959a; Mussen, 19504, 
1950b) and of the relations among different 
aspects of attitudes (e.g., Mann, 19595). 

There are differences among these three kinds 
of behavioral measures—situations appearing to 
the subject to be unstaged, role playing, and 
sociometric choice—in characteristics that affect 
the probability that overt responses will corre- 
spond to responses that would be shown if the 
individual were not concerned with presenting 
(to others or to himself) a certain picture of 
himself. Let us consider first the extent to 
which their purpose is apparent to the respon- 
dent. To the extent that the purportedly un- 
staged situations are accepted as genuine, the 
respondent will not see them as designed to get 
information about his attitudes; thus one pos- 
sible source of pressure to give responses that 
are likely to be considered desirable is elimi- 
nated. Nevertheless, the implications of his 
behavior as revealing certain characteristics may 
be apparent to him; even if he accepts a ques- 
tion about his willingness to pose with a Negro 
or to have a Negro roommate as genuine, he 
may be aware that a positive answer will have 
the effect of presenting him as unprejudiced, a 
negative answer as prejudiced. Thus, even in 
the absence of awareness that he is being tested, 
an individual may be motivated to give a re- 
sponse that differs from his spontaneous private 
one, in order to present himself to the ques- 
tioner as unprejudiced or to maintain his own 
image of himself as one who behaves in an un- 
prejudiced way. The sociometric choice method 
would appear to be similar in these respects, 
though it may perhaps be assumed that, in the 
absence of special influences calling attention 
to racial or ethnic group membership, the im- 
plications of the choices are less likely to be 
apparent. In the case of role playing, the extent 
to which the purpose of the situation and the 
implications of responses are clear presumably 


depends on the convincingness with which the 
situation can be presented as a measure of some 
other characteristic, such as acting ability. 

All of these behavioral approaches have char- 
acteristics that may operate to make it easier 
to respond in ways that may be considered un- 
desirable. In many situations it is possible to 
justify a negative response on neutral or ac- 
ceptable grounds: one does not believe in sign- 
ing petitions, or he does not like to have his 
picture taken, or he prefers Persons A and B 
to X and Y because they share his interest in 
music. Or, in the role-playing situation, his be- 
havior is shaped not by his own reactions to- 
ward the attitudinal object but by interest in 
the dramatic requirements of the situation. (To 
the extent that these alternative explanations 
are real possibilities, however, they introduce 
other problems about interpretation of the be- 
havior as an indicator of the attitude in which 
the investigator is interested.) 

Some characteristics of the behavioral ap- 
proaches may reduce the probability that the 
individual will modify his behavior in order to 
present an acceptable picture of himself. When 
responses are expected to have real-life conse- 
quences, the anticipation of such consequences 
may counterbalance the wish to make a good 
impression. In a social distance questionnaire, 
if one wishes to present himself (to the tester, 
or to himself, or both) as unprejudiced, there is 
little effective pressure against saying that one 
would be willing to work with a Negro, or to 
have a Negro roommate; but if the question is 
posed in a context where a positive reply is 
seen as leading to assignment of a Negro as a 
co-worker or a roommate, one must weigh his 
willingness to accept that consequence against 
his wish to appear unprejudiced. In role play- 
ing, the pressure for quick response to unantici- 
pated stimulus situations probably operates to 
lessen conscious control of behavior in order to 
produce a desired impression. Faced with the 
necessity of doing or saying something to keep 
the situation going, the individual may not 
have time to consider the impression he is 
making; to the extent that this is so, this 
approach may be thought of as reducing the 
individual’s conscious selection of his response. 

Thus behavioral measures seem to be less 
susceptible than simple self-report measures to 
distortion of response in the interest of present- 
ing a certain picture of the self. But they are at 
least as susceptible as self-report measures to 
the effects of other extraneous influences. It has 


298 ATTITUDE MEASUREMENT 


sometimes been suggested that the model of 
behavioral measures would be apparently un- 
staged situations in which a member of the 
object-class is present. But it is clear that be- 
havior in everyday life situations (which this 
model seeks to approximate) is not determined 
exclusively by attitude toward the presumed 
attitudinal object. In the case of behavior to- 
ward minority groups, for example, social cus- 
tom is a major determinant; in communities 
with segregated transportation systems, almost 
all white people—regardless of their attitudes 
toward Negroes or toward segregation—sit in 
the white section, whereas in communities with 
unsegregated transportation systems, very few 
white people—regardless of their attitudes—re- 
fuse to sit next to Negroes. Other values may 
override attitudes toward the presumed object; 
an individual who feels physical revulsion at 
the experience of eating with Negroes may 
nevertheless do so because he has come to be- 
lieve that the ideals of democracy, or religious 
principles of brotherhood, or the position of the 
United States in the eyes of the world, require 
that all men be treated as equals. Finally, other 
characteristics of the object individuals may 
predominate over their ethnic identification in 
determining response to them. Thus, LaPiere 
(1934) concluded that the factors which most 
influenced the behavior of hotel and restaurant 
personnel to the Chinese couple with whom he 
was traveling "had nothing to do with race"; 
rather, it was the quality and condition of their 
clothing, the appearance of their baggage, their 
cleanliness and neatness, and above all, their 
self-confident and pleasant manner, that deter- 
mined reactions. Observations such as this sug- 
gest that, to the extent that one is interested in 
tapping generalized dispositions toward a given 
group rather than in predicting behavior in 
specific situations, behavioral measures that call 
for response to a symbolic representation of the 
group may be less subject to influence by extra- 
neous factors than measures that call for re- 
sponse to members of the group who are phys- 
ically present. 

Campbell (1961) has suggested an approach 
to the use of behavior measures which is based 
on the ‘premise that different situations have 
different thresholds for the manifestation of 
hostile, avoidant, or discriminatory behavior. 
He suggests that, in order to secure evidence 
about an individual's attitude, it is necessary 
to place him in a number of situations with 


differing thresholds—ranging, for example, from 
eating with a Negro at a business men's lunch- 
eon club (assumed to be a situation with a low 
threshold for nondiscriminatory behavior—in 
other words, one in which it is easy to behave 
in an unprejudiced way) to renting one's house 
to a Negro (assumed to have a high threshold 
for nondiscriminatory behavior) 'The lowest- 
threshold situation in which an individual ex- 
hibits discriminatory behavior would indicate 
his position on a scale of attitude with respect 
to the group in question. Such a procedure 
would be effective in taking account of pres- 
sures that are constant for all, or most, individ- 
uals; it would not, it seems to us, rule out the 
effects of differences in the strength for different 
individuals of such influences as concern with 
social approval, other values seen as relevant to 
the situation, etc. 


Measures in Which Inferences Are 
Drawn from the Individual's Reaction to or 
Interpretation of Partially Structured Stimuli 


The characteristic common to techniques in 
this category is that, while there may be no 
attempt to disguise the reference to the attitu- 
dinal object, the subject is not asked to state 
his own reactions directly; he is ostensibly de- 
scribing a scene, a character, or the behavior 
of a third person. He may be presented with 
a photograph of a member of the object-class 
(usually a person of a given social group) and 
asked to describe his characteristics; or he may 
be presented with a scene in which members of 
the object-class are present and asked to de- 
scribe it, to tell a story about it, to predict the 
behavior of one of the characters, etc. The stim- 
ulus material may be verbal rather than pic 
torial; for example, the subject may be asked 
to complete sentence stubs referring to a hypo- 
thetical third person. 

The bases for inferences about attitudes are 
those common to all projective tests: assump- 
tions that perception of stimuli that are not 
clearly structured is influenced by the perceiv- 
er's own needs and dispositions; that, asked to 
provide an explanation or interpretation for 
which the stimulus presented gives no clear clue, 
the subject must draw on his own experience or 
his own dispositions or his own definitions of 
what would be probable or appropriate; that, 
asked to attribute behavior to others, especially 
under speed conditions, the most readily accessi- 
ble source of hypotheses is the individual's own 
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response disposition. As in self-report and be- 
havioral tests, the usual assumption is that the 
expressed response corresponds directly to the 
individual’s attitude; for example, that attribu- 
tion of desirable characteristics to a member of 
a given group represents a favorable attitude 
toward that group, that interpretation of a 
scene as one in which there is hostility toward 
a member of a given group represents a hostile 
attitude toward the group, that attribution of 
a positive (or a negative) response to a hypo- 
thetical third person with respect to a given 
object reflects a positive (or a negative) dispo- 
sition toward the object in question. 

A major reason for the development of such 
techniques is the assumption that, by disguising 
the purpose of the instrument and the implica- 
tions of responses, they lessen the probability 
of distortion of responses in the interest of pre- 
senting a certain picture of the self. They are 
presented to the respondent not as measures of 
attitudes but as tests of imagination, verbal 
fluency, ability to judge character, social sensi- 
tivity, or some such characteristic. To the ex- 
tent that the respondent accepts these explana- 
tions, he presumably is unaware not only of the 
purpose of the test but of the implications of 
his responses as revealing his own attitudes. 
Even if the subject does realize that he is ex- 
pressing his own attitude, it is assumed that it 
may be easier to express views that may be con- 
sidered undesirable if one does not explicitly 
acknowledge them as his own. In some instances 
the questions asked are nonevaluative, so that 
the implications of one or another response are 
quite unlikely to be apparent to the respon- 
dent; for example, “What is the [nonexistent] 
colored man in the corner doing?" (Horowitz 
and Horowitz, 1938). 

Questions have been raised, however, about 
the validity of the assumption that responses, 
even though spontaneous and undistorted, re- 
flect the individual's own attitude toward the 
object. While it seems clearly established that 
an individual's response may reflect his own 
disposition, it is not certain that it necessarily 
does so. Given a scene in which the roles of 
Negro and white are ambiguous, an individual 
who describes the Negro as being in a menial 
position may be reflecting his own devaluing 
disposition toward Negroes; on the other hand, 
he may simply be reporting the arrangement 
most commonly observed in our culture. Simi- 
larly, the responses he attributes to a hypothet- 


ical third person may be based either on his 
own response disposition or on his estimate of 
how most people would react in such a situa- 
tion. 1 

Attempts to secure evidence as to whether 
responses to instruments of this type do in fact 
reflect the individual's own attitudes have fol- 
lowed two lines: examination of the correspon- 
dence between estimates of attitude based on 
these measures and estimates based on other 
measures (usually of the self-report type); and 
examination of data secured from instruments 
of this sort in the light of predictions about 
patterns of results. 

Several studies have found significant corre- 
spondence between results of measures of this 
type and scores on selfreport measures. Pro- 
shansky (1943) found high correlations between 
scores based on a standard self-report scale for 
measuring attitude toward organized labor and 
scores based on descriptions of briefly-exposed 
ambiguous pictures of relevant social situations. 
Riddleberger and Motz (1957) found that sub- 
jects who scored high and those who scored low 
on a self-report measure of attitude toward Ne- 
groes differed in their explanations of how the 
people in a pictured interracial group had met. 
Sommer (1954), using a modified form of 
Brown's (1947) adaptation of the Rosenzweig 
Picture-Frustration Test, was able to identify 
with considerable success not only individuals 
who scored high and those who scored low on 
a self-report scale of attitude toward Negroes 
but a subgroup who had been instructed to re- 
spond to the Picture-Frustration Test as if they 
were unprejudiced, even though their self-report 
scores were unfavorable. 

However, in view of the assumption that an 
important characteristic of tests of this type is 
their relative lack of susceptibility, as compared 
with self-report measures, to efforts to present a 
certain picture of the self, correspondence with 
scores based on self-report measures is a dubious 
criterion. Getzels (1951), recognizing this fact, 
approached the problem by predicting condi- 
tions under which speeded completions of 
third-person sentence stubs would differ from 
completions, by the same respondents, of the 
same sentence stubs presented in the first per- 
son. He made two predictions: (a) that first- 
and third-person responses would differ on 
items subject to strong social norms not fully 
internalized by all members of the group and 
would not differ on items not subject to such 
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norms; and (b) that in the case of the former 
items, more socially acceptable answers would 
be given on the first-person form than on the 
third-person form. Both predictions were 
strongly supported. Getzels recognized the pos- 
sibility that responses to the third-person form 
might be based on estimates of how most peo- 
ple would respond rather than on the subjects’ 
own response dispositions. Accordingly, he asked 
the subjects to estimate how most people would 
respond to the items about Negroes, and found 
no difference between the average estimates 
made by those whose third-person responses 
had been favorable and those whose third- 
person responses had been unfavorable. 

A number of techniques involving perception 
—in a more literal sense—of ambiguous or un- 
structured material may be considered in this 
category. For example, a number of psychol- 
ogists have been investigating the possible re- 
lation of attitudes to perception of stimuli 
presented under stereoscopic conditions of bin- 
ocular rivalry. Bagby (1957), presenting pairs 
of cards differing in cultural content (e.g. a 
bullfighter and a baseball player) to subjects 
from Mexico and the United States, found that 
Mexicans tended to see the card with Mexican 
content, North Americans those with content 
familiar in the United States. Pettigrew, Allport, 
and Barnett (1958), presenting to residents of 
South Africa pairs of pictures of individuals 
from different racial groups, found that Afri- 
kaners deviated most consistently from other 
groups in their responses, overusing the “Euro- 
pean" and "African" categories, underusing 
"Colored" or “Indian.” 

A study by Bray (1950) made use of unstruc- 
tured visual material in a different way. Taking 
off from Sherif's (1935) finding that estimates of 
movement in the autokinetic phenomenon are 
markedly influenced by the estimates given by 
others, Bray investigated the effects of estimates 
by confederates who were identified as members 
of minority groups. He had the hypothesis that 
the extent and direction of such effects would 
be influenced by the subject's attitude toward 
the minority group. Here the unstructured per- 
ceptual material did not refer to the attitudinal 
object, but simply provided an opportunity for 
expressing indirectly a response to the attitu- 
dinal object—the physically present, minority- 
group member, 

Again, there are problems about the nature 
of the inferences that can be drawn. Bray, for 
example, did not find the direct relationship he 


had predicted between attitude toward the mi- 
nority group (as measured by self-report scales) 
and responses to the minority-group members’ 
estimates. In the case of binocular rivalry, in 
what way, if at all, does attitude influence per- 
ception? Does one see the picture with the most 
familiar content? Does one see the member of 
the racial group toward which he is most fa- 
vorable, or the one toward which he is most 
hostile, or of which he is most afraid? 
Questions such as these point both to the 
need for further research on the usefulness of 
these techniques as measures of attitude and to 
potentially fruitful lines of investigation of the 
relation between attitudes and response to var- 
ious kinds of materials under various conditions. 


Measures in Which Inferences Are Drawn 
from Performance of “Objective” Tasks 


Approaches in this category present the re- 
spondent with specific tasks to be performed; 
they are presented as tests of information or 
ability, or simply as jobs that need to be done. 
The assumption common to all of them is that 
performance may be influenced by attitude, and 
that a systematic bias in performance reflects 
the influence of attitude. 

For example, the subject may be asked to 
memorize material, some of which is favorable 
to the attitudinal object, some unfavorable, 
perhaps some neutral or irrelevant. The assump- 
tion is that material congenial with the subject’s 
own position will be learned more quickly and 
remembered longer. Some empirical support is 
available for this assumption; for example, in 
a study by Levine and Murphy (1943), using 
material about the Soviet Union, and one by 
Jones and Kohler (1958) using statements about 
segregation. Or the subject is given a test of 
“information,” in which at least some of the 
items referring to the attitudinal object either 
have no correct answers or are so unfamiliar 
that it can be assumed that few if any respon- 
dents will know the correct answers; alternative 
responses believed (by the investigator) to indi- 
cate relatively favorable or relatively unfavor- 
able dispositions toward the object are provided. 
The assumption here is that, when forced to 
make a guess on ostensibly factual questions 
where he has no objective basis for an answer, 
the subject is likely to choose the alternative 
most consistent with his own attitudinal dispo- 
sition. This assumption, too, is supported by 
some empirical evidence; for example, studies 
by Hammond (1948) and Weschler (1950) of 
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attitudes toward labor and toward Russia, and 
by Rankin and Campbell (1955) of attitude to- 
ward Negroes. Or the task may be a test of 
“reasoning,” in which syllogisms or other logical 
forms are presented, and the subject is asked 
to indicate which of a number of conclusions 
can appropriately be drawn. Items referring to 
the attitudinal object are paralleled by similar 
items with neutral or abstract content; scoring 
is on the basis of the number and direction of 
errors on the attitudinally relevant items as 
compared with the control items. The assump- 
tion is that reasoning may be swayed by attitu- 
dinal disposition, and thus that errors on the 
attitudinal relevant items reflect the individual's 
own position, if the parallel neutral items have 
been answered correctly. Watson (1925), Mor- 
gan (1945), and Thistlethwaite (1950), among 
others, have developed instruments of this type. 
Thistlethwaite found a significant difference be- 
tween Northern and Southern college students 
in frequency of errors on items dealing with 
Negroes (as compared with errors on the neu- 
tral items), and no corresponding difference on 
items dealing with Jews, women, or patriotism. 

Other measures place the emphasis on the 
material being judged or on the outcome to be 
achieved rather than on the ability involved in 
achieving it. For example, the subject is asked 
to sort items about the attitudinal object in 
terms of their position on a scale of favorable- 
ness-unfavorableness, ostensibly in order to help 
in the construction of a Thurstone-type scale. 
The assumption here is that the rater's own 
attitude toward the object—especially if it is 
extreme—influences his judgments of the favor- 
ableness of statements about the object. Despite 
the earlier belief that ratings of items for 
Thurstone scales are not affected by the raters’ 
own attitudes, a number of recent studies (e.g. 
Hovland and Sherif, 1952) have found such 
effects. 

It seems reasonable to suppose that most sub- 
jects accept these tasks at face value; presum- 
ably only someone with rather sophisticated 
knowledge of research techniques in the social 
sciences would be aware of their attitudinal im- 
plications. Thus it seems reasonable to suppose 
that they may be relatively impervious to dis 
tortion in the interest of presenting a desired 
picture of the self. 

Again, however, there are questions about 
the nature of the inferences to be drawn. If a 
subject shows marked and consistent bias, it 
seems reasonable to infer that he has an attitude 


toward the object strong enough to affect his 
performance. If he does not show consistent 
bias, however, are we to infer that his attitude 
is not strong, or not consistent? In other words, 
how sensitive are such measures? Is it possible 
that individuals with equivalent attitudes differ 
in the extent to which their performance on 
such tasks is influenced by those attitudes? 

Another problem has to do with the direction 
in which attitude influences the response, and, 
conversely, with the nature of the inference to 
be drawn from a given response. Responses may 
reflect either wishes or fears; a member of the 
Communist party may overestimate the number 
of Communists in the United States, but so may 
a member of the John Birch Society. A person 
who underestimates the number of Negro doc- 
tors in the United States may do so on the 
basis of his feeling that Negroes do not have 
the ability to become doctors, or he may do so 
on the basis of his belief that opportunities for 
Negroes to obtain medical training are limited. 

Judgments of the favorableness or unfavor- 
ableness of statements are subject to a similar 
problem of interpretation. Hovland and Sherif 
(1952), working with items about Negroes, found 
that ratings by Negro subjects and by white 
subjects who actively supported desegregation 
differed from ratings by "average" and by anti- 
Negro white subjects. However, other investiga- 
tors (eg, Manis, 1960; Weiss, 1959), working 
with statements about different attitudinal ob- 
jects, found that subjects with extreme attitudes 
—whether favorable or unfavorable—showed 
similar patterns of ratings, which differed from 
those made by subjects with moderate attitudes. 

As with the preceding category, these prob- 
lems of interpretation point to the need for 
caution in inferring the attitude of a given in- 
dividual from a single test of this sort, but they 
seem to point also to the probable usefulness 
of further empirical investigation of the rela- 
tion of scores based on such measures to those 
based on tests providing other grounds for 
inference. 

Another group of measures presented as ob- 
jective tasks or tests of ability focus on the 
extent to which the attitudinal object figures 
prominently in the subject's organization of his 
environment, that is, its salience for him. The 
kinds of data appropriate for inference about 
the salience of an attitudinal object differ in 
some respects from the kinds appropriate for 
inference about the nature or direction of the 
attitude. Measures of salience have been devel- 
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oped primarily with respect to attitudes toward 
social groups. They are of two types: techniques 
for assessing the tendency to classify individuals 
in terms of group membership, and techniques 
for assessing the tendency to subordinate indi- 
vidual differences to group identification. 

One technique for assessing the tendency to 
classify individuals in terms of group member- 
ship, originated by Horowitz and Horowitz 
(1938), may be presented as a test of concept 
formation. It consists in presenting to a subject 
sets of photographs of individuals differing in 
race, sex, age, and socioeconomic status and 
asking him to select those which "belong to- 
gether." For example, one set may contain pho- 
tographs of three white boys, one white girl, 
and one Negro boy. If the subject replies that 
the white girl does not belong, this is taken to 
mean that for him sex is a more important basis 
for classification than race; if he replies that the 
Negro boy does not belong, the inference is 
that race is a more important category for him 
than sex. 

Another technique for assessing the tendency 
to classify individuals in terms of group mem- 
bership, presented as a test of memory, involves 
the clustering, in recall, of verbal symbols for 
which alternative classificatory principles are 
available, This technique rests on the finding 
from studies of verbal behavior that when 
words drawn from various categories are pre- 
sented in random order, subjects tend to recall 
them in clusters, with several words represent- 
ing a given category being recalled together 
even though they were not next to each other 
in the list presented. In studying the salience 
of race as a basis for classification, a subject 
would be presented, in random order, with 
names of people from several different occupa- 
tional categories—for example, baseball players, 
musicians, political figures, actors, one name in 
each category being that of a Negro. The ex- 
tent to which names of Negroes are grouped 
together in recall would provide the basis for 
inference as to the salience of race as a basis for 
dassifying individuals. 

A measure of the tendency to subordinate 
individual differences to group identification, 
originated by Horowitz and Horowitz (1938), 
consists in showing the subject a number of 
photographs of individuals of different ethnic 
groups and then asking him to identify, from 
à larger number of photographs, those he has 
already seen. The task is presented as one in- 
volving perception and/or memory. Scoring is 


in terms of the proportion of correct responses 
to individuals of a given social group as com- 
pared with the proportion of correct responses 
with respect to individuals of other groups. The 
inference here is that accuracy in identifying 
whether or not pictures of specific individuals 
of a given social group have previously been seen 
is decreased by the tendency to subordinate 
individual differences to group identification. 

Seeleman (1940-1941), using pictures of whites 
and Negroes, found a high correlation between 
scores on this measure and scores on a self- 
report questionnaire designed to measure atti- 
tude toward Negroes, with the less-favorable 
subjects less accurate in identifying whether the 
Negro pictures had previously been exposed. 
The question whether there is, in general, a 
correlation between salience of an attitudinal 
object and favorableness of disposition toward 
it is an interesting problem for empirical inves- 
tigation. 


Measures in Which Inferences Are Drawn 
from Physiological Reactions to the 
Attitudinal Object or Representations of It 


At the opposite extreme from measures rely- 
ing on a subject's verbal report of his beliefs, 
feelings, etc., are those relying on physiological 
responses not subject to conscious control. These 
may be measures of a subject's reaction—for 
example, galvanic skin response (GSR), vascular 
constriction—to the presence of a member of 
the object group or to pictorial representations 
of situations involving members of the object 
group. For example, Rankin and Campbell 
(1955) compared GSRs obtained when the ex- 
perimenter was a Negro with those obtained 
when the experimenter was white; Westie and 
DeFleur (1959) recorded GSR, vascular constric- 
tion of finger, amplitude and duration of heart- 
beat, and duration of heart cycle, while the 
subjects were viewing pictures of whites and 
Negroes in social situations. Hess and Polt 
(1960) have photographed pupillary constriction 
in response to unpleasant stimuli and pupillary 
dilation in response to pleasant stimuli. 

Or the measures may involve responses, such 
as salivation, blinking, vascular constriction, 
that have been conditioned to a verbal stimulus, 
and, by a process of semantic generalization, 
appear in response to words or concepts that 
are similar in meaning to the original stimulus. 
For example, Volkova (1953) has reported a 
series of experiments in Russia in which sub- 
jects were conditioned to salivate in response 
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to the word coop; subsequently, such statements 
as "The Young Pioneer helps his comrade" 
brought maximum salivation, while such state- 
ments as "The Fascists destroyed many cities" 
brought minimum salivation. 

In the case of unconditioned physiological 
responses to the presence or the representation 
of the attitudinal object, the basis for inference 
comes directly from the concept of attitude. 
Just as all definitions of attitude include beliefs, 
feelings, and overt behavior as indicators of 
attitude, so do all definitions, explicitly or im- 
plicitly, include physiological responses. It is 
assumed that the magnitude of the physiological 
reaction is directly and positively related to the 
extent of arousal or the intensity of feeling; 
thus, the greater the physiological response, the 
stronger and/or more extreme the attitude is 
presumed to be. Here again, however, there are 
problems in inferring the nature of the attitude 
being reflected. Most measures of physiological 
reaction give direct indications only of the ex- 
tent of arousal; they do not reveal whether the 
corresponding emotion is pleasurable or unplea- 
surable. In general, in attempts to assess atti- 
tudes toward social groups via measurement of 
physiological responses, it has been assumed 
that the range of affect is not from strongly 
favorable to strongly unfavorable but rather 
from accepting, or neutral, to strongly unfavor- 
able; thus the inference has been drawn that 
the greater the physiological response, the more 
unfavorable the attitude. If Hess’ technique of 
photographing pupillary constriction-dilation 
can be adapted to the study of attitudes, it 
would provide a much firmer basis for infer- 
ences about the direction of attitude, since the 
reaction being measured shows a differential 
response to pleasant and unpleasant stimuli. 

In the case of the conditioned physiological 
responses, the basis for inference is somewhat 
different, stemming from learning theory. A re- 
sponse that has been conditioned to a given 
stimulus tends to generalize to stimuli that are 
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similar, Thus, if a response that has been con- 
ditioned to the concept "good" appears when 
the attitudinal object is presented, the inference 
is that the subject considers the object good— 
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if the response does not appear when the atti- 
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other environmental influences. It is difficult to 
control the experimental situation so completely 
that other factors are ruled out as possible 
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Again, questions such as these point to the 
need for extreme caution in drawing inferences 
about the attitude of a given individual from a 
measure of this type. But, again, they point to 
encouraging possibilities for empirical research 
and to the opportunity to greatly increase our 
understanding of attitudes and their relation 
to various kinds of response, by the use of in- 
struments yielding different types of evidence. 
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D. Problems and Prospects in Attitude Measurement 


26. Judgmental Phenomena and Scales of Attitude 


Measurement: Item Displacement in Thurstone Scales 


CARL I. HOVLAND and MUZAFER SHERIF 


In his influential study The Measurement of 
Attitude, Thurstone states as a major require- 
ment for attitude scales that the scale values of 
statements be independent of the position held 
by the judges who do the initial categorization: 


If the scale is to be regarded as valid, the scale 
values of the statements should not be affected 
by the opinions of the people who help to con- 
struct it. This may turn out to be a severe test 
in practice, but the scaling method must stand 
such a test before it can be accepted as being 
more than a description of the people who con- 
struct the scale. At any rate, to the extent that 
the present method of scale construction is af- 
fected by the opinions of the readers who help 
to sort out the original statements into a scale, 
to that extent the validity or universality of the 
scale may be challenged (19, p. 92). 


The first research study of the Thurstone as- 
sumption was carried out by Hinckley (8) on 


* Reprinted from Journal of Abnormal and Social 
Psychology, 1952, 47, 822-832, with permission of 
the junior author and the American Psychological 
Association. The present study, the first of several 
on the topic of judgmental phenomena and scales 
of attitude measurement, was done as part of a 
coordinated program of research on opinion and 
attitude change being conducted at Yale under a 
grant from the Rockefeller Foundation to whom 
acknowledgement is gratefully made (cf. Hovland, 
C. L, Changes in attitude through communication, 
J. abnorm. soc. Psychol., 1951, 46, 424-437). 
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attitudes toward Negroes. Opinion statements 
were categorized by two groups of white judges, 
one with anti-Negro attitudes and the other 
with pro-Negro attitudes, and by a group of 
Negro judges. Scale values were determined by 
the method of equal-appearing intervals. Hinck- 
ley found that the average scale values for the 
two white groups were highly correlated (r= 
.98) despite their difference in attitude on the 
issue and that the scale values for the anti- 
Negro white judges were also closely correlated 
with those for the Negro judges (r = .93). On 
the basis of these results Hinckley concluded 
that "the scale which we have constructed for 
measuring attitude toward the social position 
of the Negro is not influenced in its measuring 
function by the attitudes of the subjects used 
in the construction" (8, p. 298). Substantially 
similar results were obtained in a later study by 
Pintner and Forlano (14), who used the Atti- 
tude toward Patriotism scales which had also 
been derived by the method of equal-appearing 
intervals. 

That the attitudes and opinions of the judges 
have no effect on the placement of items is in 
sharp conflict with the results of studies in the 
fields of perception and judgment. These stud- 
ies indicate that judgments are greatly influ- 
enced by motivational and attitudinal factors 
operative at the time. Thus in the studies by 
Asch, Block, and Hertzman (2) where subjects 
judged photographs, political figures, profes- 
sions, and slogans along such dimensions as 


JUDGMENTAL PHENOMENA AND SCALES OF ATTITUDE MEASUREMENT 237 


intelligence, honesty, and social usefulness, 
judgments were made “in accordance with an 
underlying attitude of acceptance or rejection.” 
Other studies indicate that category intervals 
are directly affected by attitudinal factors. As 
long ago as 1914 Fernberger (7) showed that 
changes in attitude bring about changes in the 
range of judgments of equality, hence in the 
size of category intervals. A series of studies by 
Volkmann (20), Rogers (16), and McGarvey (13) 
showed that category intervals are caused to ex- 
pand or shrink by the experimental introduction 
of anchoring agents beyond or within the scale 
and that “anchoring can be achieved by appro- 
priate verbal instructions, without the use of 
anchoring stimuli” (21, p. 287). 

Displacement of judgments along a scale 
seems a general phenomenon when the stimuli 
carry strong social and personal value. For 
example, Cartwright found that the S's estima- 
tion of equivalence of statements was affected 
by his own position. “The size of the range of 
equivalence related to a ‘radical’ and to a ‘con- 
servative’ differed, depending upon S's own 
political attitudes” (3, p. 195). “For the radical 
S, the judgments of ‘radical’ extend 7.5 units 
while the judgments of ‘conservative’ extend 10 
units. For the conservative S, on the other hand, 
the judgments of ‘radical’ extend 8.5 units, 
while the judgments of ‘conservative’ extend 
only 5.2 units" (3, p. 189). 

In studying the response of Negro youth to 
the social value placed on light skin color, 
Johnson (10) found a tendency for self-ratings 
to be a shade or more lighter than they appear 
to the tester. This was shown even more defin- 
itively in the study by Marks (12) who matched 
each Negro S’s skin color with an objective hue. 
The tendency was to displace one’s own color- 
ing as much as objective fact allowed in the 
direction of the desired norm of light brown. A 
dark individual, rating himself close to average, 
would displace individuals lighter than himself 
to lighter than average categories. In contrast, 
a light individual would judge the majority of 
the group as dark. While the relative position 
of each $ is kept the same by different raters, 
the absolute position varies from rater to rater. 

It is interesting to note that in Cartwright's 
study the Ss were more discriminating in accept- 
ing items at their own end of the scale, and 
lumped together items at the extreme opposite 
to their own stand, while in the study of Marks 
the displacements of items (skin color) were in 
the direction of the desired skin color. It is 


apparent that the direction of item displace- 
ment will depend upon both the stimulus ma- 
terial and the S's motivation in relation to it. 

More recently Hinckley himself, in collabo- 
ration with Rethlingshafer, added further sub- 
stantiating evidence of the effect of the judge's 
own attitude upon his judgments. Starting with 
the general assumption that “men judging 
heights of men on a scale of short to tall would 
be influenced in their scale values by their own 
heights” (9, p. 257), these investigators found 
that “the judgment of the average height of all 
men is influenced by the height of the man 
making the judgment. The ‘meaning’ of the 
social value terms of ‘short’ and ‘tall’ is in part 
determined by the height of the judge. The 
‘egocentric’ influence is also controlled by the 
objective facts, particularly in judging the ex- 
treme heights” (9, p. 262). 

Thus we have the paradox that some leading 
texts in social psychology say in the chapter on 
perception and judgment that judgments are 
greatly affected by the individual's attitudes and 
motives, while in the chapter on scaling meth- 
ods they state that judgments of the meaning 
of items are unaffected by the position of the 
judges who do the sorting. Analysis of prior 
attitude scale studies suggests a possible expla- 
nation for the discrepancy: the utilization of 
too narrow a range of individuals, with the con- 
sequent elimination of the extreme Ss who 
would show variation and displacement of judg- 
ments most clearly. In the Pintner and For- 
lano study, the question may be raised as to 
whether intense personal involvement was 
achieved on the issue of patriotism during the 
thirties, and the authors report that "No group 
could be called unpatriotic" (14, p. 41). In the 
Hinckley study (8), on the other hand, consid- 
erable personal involvement in the issue would 
be expected, particularly on the part of the 
Negro judges. But the report suggests that here 
again the range may have been unintentionally 
narrowed by a procedure he intended only to 
eliminate "careless" subjects: 


One tendency which revealed itself in the sort- 
ing of the subjects was the bunching of state- 
ments in one or more piles to the apparent 
detriment of the other piles. This phenomenon 
of bunching at the extremes was noticed in the 
case of certain of the white subjects, but was 
especially noticeable in the Negro subjects. 
Since the 114 statements are distributed with 
fair uniformity over the entire scale, marked 
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bunching is a sign of careless sorting. If more 
than a fourth of the statements are assigned 
to any one pile, it will leave less than three- 
fourths to distribute over the remaining ten 
piles. Furthermore, the individual who sorts 
the statements in this fashion often ignores 
some of the piles completely. On the assump- 
tion that this bunching was due to poor dis- 
crimination and carelessness every case having 
30 or more statements in any one pile was auto- 
matically eliminated from consideration, and 
the results were not recorded (8, p. 283).1 


From the studies of judgment cited above it 
would be predicted that this piling up of state- 
ments in certain categories by some of the 
white Ss and, in a more noticeable way, by the 
Negro Ss may be due to the effect of the Ss’ 
strong attitudes on the topic. It may be that 
by eliminating these Ss the investigator elimi- 
nated, at the same time, those who had the 
most accentuated attitudes on the issue. It is 
interesting, in connection with this hypothesis, 
that even after all Ss placing 30 or more state- 
ments in any category were eliminated, there 
was still "a slight tendency for Group I, which 
is prejudiced against the Negro, to judge a 
given statement to be more favorable to the 
Negro than Group II [favorable to the Negro} 
judged it" (8, p. 293). 


PROBLEM 


The previous studies of the Thurstone scale 
do not appear to the writers to have fully set- 
tled the issue of whether or not scale values 
are affected by the attitude of the judges, since 
there does not appear to be sufficient evidence 
that adequate representation was made of $s 
at the extremes, with intense involvement in 
the issues being judged. We were, therefore, 
interested in reinvestigating the problem un- 
der conditions which would provide the maxi- 
mum opportunity for the operation of the 
types of factors which have been found impor- 
tant in the field of perception and judgment. 
It is hoped that such a study will be a contri- 
bution to a rapprochement between the areas 
of judgment and of attitude measurement. 

Accordingly, the Hinckley experiment was 
duplicated using attitudes toward Negroes, but 
with means of insuring that the Negro and 
pro-Negro white judges were not selected in 
any way which might exclude those with the 


1 Italics ours. 


strongest attitudes on the issue, as we thought 
possible with the Hinckley procedure of elimi- 
nating individuals who placed 30 or more 
items in any one category. Moreover, checks 
were incorporated to evaluate his inference 
that such individuals were “careless.” 

On the assumption that the set of 114 state- 
ments used by Hinckley represents a wide range 
of positions, with some clearly pro and others 
clearly con, and that a large number are of a 
neutral character, a favorable situation exists 
for studying how internal factors such as mo- 
tives and attitudes affect the placement of the 
rather unstructured and ambiguous middle- 
position statements. The literature on judg- 
ment and perception cited earlier clearly 
suggests the general hypothesis that the position 
of a judge on an issue in which he has strong 
personal involvement will constitute an "an- 
chor” for his judgments, whether or not his 
attitude is specifically called for in the instruc- 
tions. The specific hypothesis for the present 
study may be stated in summary form as 
follows?: 


1, Judges with extremely pro or con atti- 
tudes will show a tendency to concentrate their 
placement of items into a small number of 
categories. 

2. Judges with an extreme position and 
strong personal involvement will be highly 
discriminating in accepting items at their own 
end of the scale. They will correspondingly 
display a strong tendency to lump together 
statements at the end of the scale which they 
reject. The former tendency can be described 
as a raised threshold of acceptance and the 
latter as a lowered threshold for rejection. 

3. A greater degree of displacement will oc- 
cur for the "neutral" items and a smaller de- 
gree for the sharply defined pro and con 
statements at the extremes. 


In less technical terminology these hypotheses 
state that individuals with strong personal in- 


2 These hypotheses are formulated in relation to 
the placement of items in categories along a scale 
prescribed by the experimenter. Such scales and 
categories may be referred to as "imposed" scales 
and "imposed" categories. We have also extended 
these hypotheses by allowing each $ to define his 
own scale by choosing the number of categories he 
personally considered necessary to distinguish the 
different positions in the issue, Results using these 
methods, obtained in conjunction with the present 
experiment, will be reported in a separate paper. 

Studies in addition to those already cited which 
bear on our hypotheses include 1, 4, 11, 15 and 17. 
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volvement will tend to see issues pretty much 
“in all black and white” rather than with fine 
distinctions, and that statements even mildly 
critical of their position will be judged to be 
more hostile by them than by more neutral 
individuals. 


METHOD 


Procedure 


The problem presented above requires a 
design satisfying the following: (a) a topic 
which has strong personal involvement for Ss 
chosen to represent the pro or con stand on 
the issue, (b) clear differentiation of Ss with 
respect to their position on the issue, (c) admin- 
istration of the judging task without giving 
any hints that an attitude study is involved, 
and (d) checks to determine whether heavy con- 
centration of statements in any category is due 
to “carelessness.” For these reasons, the Atti- 
tude Toward Negro scale was considered to 
be the most appropriate topic, and through 
the kindness of Professor Hinckley we were 
able to secure the original 114 statements he 
used concerning the social position of Negroes. 
These statements had been selected to include 
the entire range from very favorable to very 
unfavorable on the issue and to be distributed 
in a fairly representative way over the range. 

The prime consideration in each step of the 
procedure was that all means be taken to in- 
sure careful and uniform administration to 
relatively small groups of Ss in each session 
and to be certain that Ss understood the in- 
structions at each point. In every case the 
experiment was administered by graduate 
assistants who had been thoroughly briefed 
concerning the procedure, who had themselves 
carried out the instructions under close super- 
vision, and who had in most cases practiced 
the procedure on a trial group. A special point 
was made to select administrators who would 
appear to Ss to be “one of them.” The groups 
were kept small in number whenever possible, 
in order that administrators could easily but 
unobtrusively observe Ss while instructions 
were being carried out. On the whole the sort- 
ing was administered in large rooms, chiefly in 
laboratory space, in order to give each $ ample 
room to spread out the statements before him. 

Categorization of items. The first task for the 
Ss consisted of sorting the statements into piles. 
Each of the 114 statements was mimeographed 
on a $x5 in. card. To facilitate later tabu- 


lation, an arbitrary code number unrelated to 
the content of the item was assigned to each 
statement and stamped on the back of the card. 
No S asked a question concerning any possible 
connection between the statement and the code 
number. As a further precaution, the cards 
were shuffled beforehand so that $ would not 
be able to get an idea about the exact number 
of statements. It was thought that the knowl- 
edge of the exact number of statements might 
tempt him to divide the total number of cards 
by the number of categories to give him a fixed 
guide for the number to be assigned to each 
category. 

Deliberately, no mention of attitude mea- 
surement was introduced in the instructions 
until the sorting phase was completed. It was 
only after the sorting was completed by $ that 
instructions aiming to tap his attitude on the 
issue were introduced. To insure this, the in- 
structions pertaining to sorting and those re- 
quiring the checking of S's particular stand on 
the isue were put on different sheets, and 
those relating to the attitudinal aspect were 
not passed out until the sortings were com- 
pleted. This temporal sequence was employed 
in line with our basic hypothesis that, whether 
or not the attitude of Ss was explicitly activated, 
the 5s would reveal it in their behavior when 
faced with appropriate stimuli. 

After receiving the envelope containing the 
114 statements, Ss were given the appropriate 
instructions for sorting. The instructions for 
the actual procedure of sorting were identical 
with those used by Thurstone and Chave (19), 
Hinckley (8), and others. The Ss were given 3 
to 4 minutes to read the following instructions 
aloud: 


You are given a number of statements ex- 
pressing opinions in regard to the social posi- 
tion of Negroes. These cards are to be sorted 
into different piles. 

You will find it easier to sort them if you 
look over a number of cards, chosen at random, 
before you begin to sort. 

You are given eleven cards with roman num- 
bers on them: I, II, III, IV, V, VI, VII, 
VIII, IX, X, XI. Please arrange these before 
you in regular order. Under Card I, put those 
statements which are most unfavorable in re- 
gard to the social position of Negroes. Under 
Card XI, put those statements which are most 
favorable in regard to the social position of 
Negroes. Under each of the other 9 cards, be- 
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tween I and XI, put those statements which 
correspond to that step in the 11 piles.* 

This means that when you are through sort- 
ing you will have 11 piles of statements ar- 
ranged in order from I, the lowest, to XI, the 
highest. 

Use your judgment as to where each state- 
ment should be placed in the 11 piles. Do not 
be concerned about the number of cards in 
each pile. 

When you are through sorting, please put a 
rubber band around each of the 11 piles of 
cards, placing the numbered card on top of 
each pile. 

If you complete the sorting before others, 
please remain quietly in your seat until final 
announcements are made by the experimenter. 


After the appropriate instructions for sorting 
were read aloud, the administrator answered 
any questions concerning the sorting to be sure 
that instructions were understood. Adminis- 
trators were, however, cautioned not to use 
examples relating to Negroes to clarify the 
instructions. For example, as an instance of a 
very unfavorable statement on an issue, admin- 
istrators used “Organized religion is the great- 
est single detriment to the advancement of 
civilization in the world today”; or, as an exam- 
ple of a very favorable statement on an issue, 
“Organized religion is the only hope for the 
salvation of the world.” 

When the instructions for sorting were clear 
to the Ss, the sorting began. During this time, 
the administrator distributed rubber bands to 
each $ and performed other tasks so that he 
would be in a position to see that each S was 
conscientiously performing the task according 
to instructions, without seeming to be observing 
the placement of specific cards. The sorting 
process consumed about 55 minutes. The Ss 
were reminded to remain in their seats and 


3 Probably for this issue, it would have been pref- 
erable to use I as the favorable and XI as the un- 
favorable end. The few sortings done by Negro Ss 
under individual supervision showed a tendency 
to switch I to the favorable end unless reminded 
that I represented the unfavorable end. Therefore, 
special care was taken in the experimental sessions 
(especially with Negro Ss) to repeat several times 
that I was the unfavorable, XI the favorable end 
of the scale. To provide maximum comparability 
we conformed to Hinckley's use of I as the most 
unfavorable end. A small number of Ss consistently 
reversed the two extremes and these data were re- 
copied in the standard direction. 


keep their cards with them until all Ss had 
finished sorting. 

Obtaining the judge's own attitudes. The 
second instruction sheet, which concerned the 
Ss’ agreement and disagreement with the piles 
of statements they had sorted, was then dis- 
tributed. After sorting, the following instruc- 
tions were given for the purpose of tapping 
Ss’ attitudes on the issue: 


Now that you have completed the sorting of 
cards into piles, pick up that pile of cards 
which comes closest to your view on the issue 
(that is, the social position of Negroes). On the 
numbered card on top of that pile, please write 
the word "agree." After writing the word 
"agree" on this pile, please write one of the 
following to indicate the degree of agreement 
with that pile: (a) "very strongly" (b) 
"strongly," or (c) "mildly." 

Now pick up that pile of cards which is most 
objectionable from your point of view. On the 
numbered card on top of that pile, please 
write the word "disagree." After writing the 
word "disagree" on.this pile, please write one 
of the following to indicate the degree of dis- 
agreement with that pile: (a) "very strongly," 
(b) "strongly," or (c) "mildly." 

When you have completed the above, please 
replace the 11 piles of cards, each carefully 
separated by its rubber band, in the envelope. 

This is part of a scientific research project. 
The results of each person's sorting will be 
treated as scientific data and will be confi- 
dential. 


Check on the “carelessness” hypothesis. The 
chief purpose of the next procedural step was 
to have the Ss estimate the proportion of state- 
ments which they judged to be “very unfavor- 
able,” “unfavorable,” "neutral" "favorable," 
and “very favorable” to Negroes to determine 
whether or not there would be general corre- 
spondence between actual proportions of cards 
sorted into various piles and the Ss' estimates 
of these proportions when the cards were out 
of sight. This step is critical for the problem 
at hand. If there is a general correspondence 
between the patterns of these percentages filled 
in by $ without the actual sight of the objec- 
tive piles he has made and the actual propor- 
tions into which he sorted the cards, the 
explanation for disproportionately large piles 
(of 30 or more statements) as due to "careless- 
ness” appears inadequate. Correspondence be- 
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tween the actual proportions in the piles and 
the percentages filled in by Ss at a later time 
would indicate a consistency in evaluation of 
the stimulus material at two time intervals 
which would not be expected if the sorting 
were casual and random. 

The estimates of sortings were obtained after 
all Ss had replaced the statements in their 
envelopes. The administrator passed out a 
3 x5 in. card to each S individually so that 
he could check to be sure that the sorted piles 
were in the envelopes. The administrator ex- 
plained the task orally as follows: 


Now that you have sorted the statements into 
piles in terms of their stand regarding Negroes, 
please estimate the percentage of statements 
which in your judgment were “very unfavor- 
able" to Negroes, “unfavorable” to Negroes, 
“neutral,” “favorable” to Negroes, and “very 
favorable” to Negroes. Write the per cent for 
each of these stands printed on the card from 
“very unfavorable” to “very favorable” in the 
appropriate space under the column headed 
“% of statements regarding Negroes.” The five 
percentages which you write should add up to 
100%. You will note that there is a question 
at the bottom of the card to be answered on 
the back. I believe the other items are self- 
explanatory. 


Supplementary attitude measurement with 
white judges. It was important to have a clearer 
differentiation of groups than that afforded by 
the rather crude list of six statements used for 
this purpose by Hinckley. In the case of Negro 
Ss, the intensity of agreement and disagree- 
ment with the piles of statements they had 
sorted and the obvious attitude of Negro Ss 
sufficed. But for the white Ss, a further check 
on their attitudes was obtained by administra- 
tion of the Likert Negro Scale* as the final 
step in the session. The scale was not admin- 
istered to Negro Ss because discussion with 
Negro pretest Ss revealed a strong animosity 
toward filling out a form prepared entirely 
from the white point of view. In fact, some 
Negro Ss spontaneously remarked that the 
statements to be sorted were prepared from 
the white standpoint. 

Subjects 

The final procedure presented above was 

decided upon after pretesting on both white 


4 See Likert, R. A technique for the measurement 
of attitudes. Arch. Psychol., N. Y., 1932, No. 140. 


and Negro Ss (N = 100). Ten Ss made the sort- 
ings individually under close supervision with 
the view of getting detailed hints for the 
procedure, 

The main concern in the selection of the 
final experimental Ss was to insure groups of 
Ss who would represent differentiated segments 
on the scale ranging from an extremely favor- 
able position to an extremely unfavorable posi- 
tion in regard to the issue of the social position 
of Negroes. For this purpose, groups and insti- 
tutions were selected which would insure a like- 
lihood of obtaining such differentiated positions 
on the scale. 

Negro groups. It followed that Negro Ss 
should be used to represent the pro end of 
the scale. A total of 103 were used. Fifty-four 
were enrolled as graduate students or were at- 
tending the summer session at the University 
of Oklahoma, These more mature Negroes may 
be taken as the most self-consciously pro group 
in this study. Since these individuals were 
attending the University in the year following 
the Supreme Court decision ending segregation 
at the University, they had a sense of mission 
and an accentuated identification as members 
of their group. Many were brought to the uni- 
versity by the National Association for the 
Advancement of Colored People, which has 
encouraged Negroes on the campus to form ac- 
tive social groups. As a group, these students 
had had more actual contact with discrimina- 
tion practices than the younger Negro Ss. 

The younger group of Negro Ss was made up 
of 49 undergraduates at a state university for 
Negroes; hence, their age and educational level 
are lower than that of the University of Okla- 
homa group. While segregation and awareness 
of anti-Negro feeling had been a part of their 
life, they had grown up largely in a Negro 
environment. Since this university is also lo- 
cated in a Negro community, this relatively 
“sheltered” condition is continued in academic 
life. 

White groups: 1. Pro-Negro. Because validity 
derived from the stands taken in actual life 
was the desired criterion governing selection of 
subject groups, two small groups of white Ss 
who had actually participated in pro-Negro 
activities were chosen. Members of these groups, 
totaling 19, had been among the leaders in 
local campaigning at the University of Okla- 
homa (by student polls, newspaper work, etc.) 
for an end to segregation. One group was se- 
lected by the director of the YMCA as among 
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those individuals who were thus active. Another 
group had been organized around this issue. 
Thus, for these very pro-Negro white Ss, per- 
sonal information as well as details of the 
stand of their groups were available. 

2. “Average” subjects. Here results from a 
number of different colleges throughout the 
South were secured. These included two in an 
area in southeastern Oklahoma popularly 
known as “Little Dixie.” In general, students 
attending these schools are from a lower socio- 
economic level than those attending the state 
university. The educational level of the area 
is relatively low. Affiliation with fundamentalist 
religious groups is common. The students in 
these colleges are, for the most part, preparing 
to become teachers. From one, 53 Ss were 
chosen from classes in elementary English. 
From the other, 58 students who had not been 
exposed to courses in either psychology or soci- 
ology, were used. 

Three colleges and universities from the state 
of Georgia were also secured. Sixty-four Ss 
were obtained, distributed about equally 
through the three institutions, In all, 194 white 
judges were used. 

3. Anti-Negro subjects. It was hoped to secure 
a group of strongly anti-Negro Ss. The major 
difficulty in finding strongly anti-Negro Ss lay 
in the fact that no groups organized strictly 
around such a stand were available. Further, 
pretesting revealed a strong tendency on the 
part of college students to express more liberal 
attitudes than would probably have been ex- 
pressed at the time of the original Hinckley 
study. A number of such Ss who were believed 
to be somewhat less liberal on this issue were 
obtained from living units at the University 
of Oklahoma. But it finally appeared necessary 
to use the Ss in the total white sample who 
expressed the most anti-Negro attitudes on the 
evaluation cards and on the Likert question- 
naires. 


RESULTS 


Categorization of Statements 
by Various Groups 


One method of analyzing any differences 
between the major groups of Ss in the degree 
to which items are displaced is to determine 
the number of statements placed in each of 
the eleven categories. Results on these distri- 
butions are shown in Figure 1 for (a) Negro 


Ss, (b) strongly pro-Negro white Ss (leaders in 
the anti-segregation movement), and (c) “aver- 
age” white Ss from six Southern colleges. To 
permit comparison of the results for the last 
group with those obtained by Hinckley, his 
procedure of eliminating all individuals plac- 
ing 30 or more statements into any single 
category was also studied. Data are also pre- 
sented for a group of 17 Ss who appeared to 
have the most anti-Negro attitudes within the 
white group. These 17 Ss had indicated dis- 
agreement with the pile of statements most 
favorable to Negroes and had markedly anti- 
Negro Likert scale scores (below 50). 

It will be observed that the Negro group 
shows a heavy piling up of statements in the 
extreme categories, indicating displacement of 
neutral statements to the extreme position. 
This displacement is particularly noticeable for 
category I at the anti-Negro end, a position 
with which these Ss strongly disagree. The dis- 
tribution of responses of the pro-Negro white 
group is similar to that for Negroes but a little 
less extreme. The dotted line at 30 indicates 
the frequency which Hinckley used to eliminate 
Ss, Over three-fourths of the Negro Ss and two- 
thirds of the pro-Negro white Ss would be 
eliminated by applying the Hinckley criterion. 

The data for the average white group show 
considerably less piling up at the ends. When 
the Hinckley elimination procedure is fol- 
lowed the distribution is relatively flat. The 
differences in the distributions for the Negro 
and average white are highly significant statis- 
tically. 

The results for the anti-Negro Ss are not 
strikingly different from those for the average 
white Ss, But it will be noted that as compared 
with the white 5s they tend to show à slightly 
greater tendency to concentrate items at the 
positive end of the scale. 


Scale Values for the Various 
Criterion Subgroups 

Another method of studying displacement of 
items is to investigate the distributions of scale 
values obtained for the various subgroups. 
These are the data: on which Thurstone, 
Hinckley, and others base their main case for 
the validity of their scaling procedure. Inspec- 
tion of the original 114 items reveals that all 
of the statements are not equally appropriate 
for white and Negro Ss. For example, items 
like “I would not patronize a hotel that ac- 
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"Average" white subjects 


Number of statements placed in category 


I IH WIV V VI VII VIIIIX X XI 


50 


Pro-Negro white subjects 


I H WIV V VI VIIVIIIX X XI 


White subjects 
(using Hinckley criterion 
for elimination) 


0 
I HE WIV V VI VII VIIIX X XI I I II IV V VI VII VIIIX X XI 


Anti-Negro white subjects 


I H IH IV V VI VIL VILIX X XI 


Figure 1. Number of statements placed in each of 11 categories by each group 


of judges. 


commodates Negroes” or “My lack of contact 
with the Negro makes it impossible for me to 
pass judgment as to his social position" are not 
as relevant to Negro Ss as to whites. But it was 
possible to find 11 items which seemed equally 
appropriate for the Negro and white popula- 
tions, and which were approximately equi- 
distant in the original Hinckley scaling. Scale 
values were then found for each of these items 
for each of the present groups, using the con- 
ventional Thurstone procedure. The eleven 
items were all used in the final forms of the 


Scale of Attitude toward the Negro.5 They are 
numbered 5, 6, 10, and 16 in Scale A, and 1, 
2, 3, 5, 8, 9, and 15 in Scale B. 

The distributions of scale values for these 
eleven items derived from the responses of the 
various groups are presented graphically in 
Figure 2. For comparison, the scale values re- 
ported by Hinckley are given. It will be seen 


5 Hinckley, E. D. Attitude toward the Negro, Scale 
No. 3, Forms A and B. (L. L. Thurstone, Ed.) 
Chicago: Univer. of Chicago Press, 1930. 
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Figure 2. Scale values of 11 selected items for each of the four major groups. 


that there is considerable similarity in the scale 
values for the average white Ss (with the 
Hinckley procedure of eliminating those with 
30 or more statements in any category) and 
the scale. values originally found by Hinckley. 
The correlation between the two sets of scale 
values is .96 (11 items). But the values for the 
strongly pro-Negro white judges and for the 
Negro judges do not correspond at all to those 
obtained by Hinckley. The distributions of the 
Negro groups show the majority of the items 
bunched up at the extreme anti-Negro end of 
the scale (in line with the results of Figure 1). 
The items which are neutral for Hinckley's 
group and for our "average" white Ss are dis- 
placed by the Negro judges to the extremes, 
principally to the anti-Negro end. The distri- 
bution of scale values for the pro-Negro white 
Ss is similar to that for the Negro Ss but again 
not quite as extreme. The results for the anti- 
Negro Ss also show a tendency for neutral 
items to be displaced toward the extremes, but 
in their case some displacement is also notice- 
able toward the pro-Negro end. 


“Carelessness” as an Explanation 
for the Results 

It will be recalled that Ss who placed 30 or 
more statements in any single pile were dis- 
carded by Hinckley on the grounds of careless- 
ness. To obtain a check on this explanation 
for the large number of our $s who placed this 
number of statements in the extreme position 
piles, after the sorting was completed and the 
cards placed in their envelope, a question was 


asked concerning the Ss' impressions of the 
percentage of the statements which were "very 
unfavorable,” “unfavorable,” “neutral,” “fairly 
favorable,” and “very favorable” to the Negro. 
From these data we have evidence that the Ss 
who placed a large number of their cards in 
the extremely unfavorable categories may 
really have perceived the statements in that 
way and did not place them there through 
carelessness. The data are presented in Table 1 


TABLE 1. Percentages of Statements Con- 
sidered “Very Unfavorable” or “Unfavorable” 
to Negroes by Individuals Placing Varying 
Numbers of Statements in Category I 
(Anti-Negro) 

Percentages Making Various Estimates 


of Unfavorableness of Items among 
Individuals Who Placed 


Per Cent of Less 60 or 
Items Judged than 30 30 to 59 More 
“Very Un- Items in Items in Items in 
favorable” or Category I Category I Category I 
“Unfavorable” (N = 119 (N = 105) (N = 34) 


60 or over 


40-59 55.5 39.0 17.7 
Less than 40 21.8 9.5 2.9 
x? = 43.3 
p = «001 


to show the difference in the evaluations of 
the number of unfavorable statements in the 
list among Ss who placed less than 30 state- 
ments at the anti-Negro end of the scale, 
among those who placed between 30 and 59, 
and among those who placed 60 or more in 
this category. The Ss who placed a greater num- 
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ber of statements in the anti-Negro category 
were definitely more inclined to believe that 
there were a high number of unfavorable state- 
ments in the list. The chi-square value for the 
distributions is 43.3, which gives a probability 
value of considerably less than .001. A very 
high degree of relationship is not to be ex- 
pected in view of the fact that Ss had not been 
given any prior intimation that they would be 
required to make such a judgment. For the 
entire group of 258 Ss on whom the relevant 
information was available, a correlation of .32 
was obtained between the percentage of state- 
ments considered "very unfavorable” and “un- 
favorable" and the number of cards placed in 
the extreme anti-Negro category. This corre- 
lation is significantly different from zero at con- 
siderably less than the .001 level. These data 
suggest that Ss who put a large number of 
statements into the extreme category do so not 
because of carelessness but because they actually 
perceive the statements differently. 


DISCUSSION 


The results just presented clearly support the 
three principal hypotheses investigated. Even 
though judges are not asked to state their own 
attitudes, the position of the judge on the issue 
does influence in a significant way his cate- 
gorization of the items. The displacements are 
away from the individual's own position, sup- 
porting the notion that those with extreme 
positions are more selective in the statements 
they will accept at their own end of the scale 
(their "threshold of acceptance" is raised), and 
more inclined to reject items which are at 
variance with their position (their "threshold 
of rejection" is lowered). The degree of dis- 
placement is greater for the items in the middle 
of the scale, in line with the hypothesis that 
“neutral” items are more ambiguous? and less 
well structured, and hence are more readily 
subject to the personal interpretations of the 
judges. 


€ Edwards (5) has called attention to "lower Q 
values at both extremes of the scale and higher Q 
values toward the center of the scale" (p. 163). On 
the basis of this tendency Edwards concluded 
"Clearly, then, we have evidence to indicate that 
the scale and Q values of items sorted by the 
method of equal-appearing-intervals are not inde- 
pendent, that "neutral" items are relatively more 
ambiguous than items at either extreme of the 
continuum." (Italics ours.) 


The present data do not indicate whether 
the distortions and displacements are caused 
by the fact that individuals with extreme posi- 
tions lack the ability to discriminate between 
adjacent items at the opposite extreme, and 
hence place them in the same category, or 
whether they reflect variations in the judge's 
interpretation of the total scale and the mag- 
nitude of category intervals. On the surface, 
the second alternative is supported by the study 
of Ferguson (6) who found no differences in 
the distributions of paired comparisons for 
judges with differing attitudes toward war. But 
in the light of the present results a question 
is again raised as to the magnitude of differ- 
ences in attitudes of his contrasted groups of 
judges and in the degree of their involvement 
in the issue. During the thirties, when the 
study was done, it is likely that variation in 
attitude toward war may not have been very 
pronounced, and Ferguson himself states that 
it was impossible "to secure any groups favor- 
able toward war" (p. 116). The data of the 
present experiment suggest that it is quite 
likely that the effects are specific to methods 
of absolute scaling (the second alternative), 
since the rank orders of the items for the pro 
and con judges are very similar. An experiment 
is currently underway specifically on this prob- 
lem, in which the Attitude toward the Negro 
issue is again used with Negro Ss in an at- 
tempt to see whether the items which are 
bunched together with the equalappearing 
interval method are clearly differentiated with 
the method of paired comparison. 

On the basis of the present study it appears 
quite likely that the startling difference between 
the present results and those of Hinckley is 
due to his elimination of the extreme cases. 
When his elimination procedure is followed 
and the sample restricted to white Ss, there is 
a close correspondence in results even though 
20 years have elapsed since his original study. 

Evidence against the explanation of the 
results as due to “carelessness” comes from sev- 
eral sources. The data already presented indi- 
cate that those who placed a large number of 
the statements in the extreme anti categories 
judged at a later time that more of the items 
had been unfavorable to Negroes. Further evi- 
dence comes from another study done in con- 
junction with the present one in which Ss were 
allowed to use as many or as few categories 
for sorting as they deemed necessary to distin- 
guish among the items on the issue. The Ss 
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who placed a large number of items in the 
extreme categories under the present instruc- 
tions used fewer categories to express shades 
of opinion and again placed a large number 
in the extreme categories. A different type of 
evidence is the lawful nature of the phenom- 
enon. It is the very subgroups with extreme 
positions (determined either by their own atti- 
tude statements or by their assignment to a 
criterion group) who place the large number 
at the extremes, and the items they displace 
are those in the middle of the scale (the more 
ambiguous items). 

Our results do, as Thurstone (19) had clearly 
anticipated, raise serious questions as to the 
use of the method of equal-appearing intervals 
on issues where strong attitudes are involved. 
Further research is needed to determine 
whether under these conditions a single scale 
is involved for the different groups of judges 
or whether distinctive scales will be required. 
A Guttman or Lazarsfeld type of analysis will 
be required to answer this question (18). Even 
if a single scale is involved, great care will be 
required in the selection of judges, since their 
position may influence the scale values ob- 
tained. Obtaining an unbiased sampling of 
positions may constitute a difficult practical 
problem, since on many issues attitudes are 
related to educational and socioeconomic status. 
It may therefore be very difficult to secure 
representatives of all positions among those 
who are well enough educated and have suf- 
ficient sophistication to carry out the difficult 
task of categorizing the items. 

From the present results there emerges an 
interesting possibility for developing a be- 
havioral “projective” method of attitude 
assessment through study of the way an indi- 
vidual sorts statements on an issue. If the ten- 
dencies found in the present experiment for 
individuals with extreme positions to bunch up 
the statements at the extremes are found for 
other issues, it may be possible to assess the 
attitude of an individual without ever asking 
him his opinion but by relying entirely on the 
way he distributes his judgments. Individuals 
with more or less neutral attitudes would be 
expected to space their judgments rather evenly 
over the entire range, those at the pro end 
would tend to reject neutral items and hence 
pile them up at the anti end, and those with 
anti attitudes would place them at the opposite 
end of the scale. As indicated previously, one 
may even carry this technique a step further by 


not using a fixed number of categories, but by 
deliberately letting the individual establish a 
scale of his own with whatever number of cate- 
gories he sees fit. Then the extension or con- 
striction of the scale he establishes, the number 
of categories he uses, the crowding or neglect- 
ing of certain categories, and the direction of 
concentration of items may all provide useful 
indices for analysis of the way the individual 
"perceives" the issue. In such a technique one 
would obviously want to insure the inclusion 
of a large number of ambiguous and unstruc- 
tured items since these will be the ones which 
are more conducive to displacement. In another 
study done in conjunction with the present ex- 
periment, investigation has been carried out 
along these lines and the results will be reported 
in a forthcoming paper. 


SUMMARY 


l. Investigation was made of the apparent 
discrepancy between studies in the field of judg- 
ment, where the position of the individual on 
an issue affects the nature of his judgments, 
and those in the field of attitude scaling, where 
it has been reported that the scaling of state- 
ments is independent of the judge's stand on 
the issue. 

One possible hypothesis for the discrepancy, 
based on an analysis of the original studies, was 
that prior experiments had not employed Ss 
with a sufficiently wide range of attitudes to 
represent adequately the strongly involved $s 
who would be most likely to show the distor- 
tions and displacements typically obtained in 
the field of perception and judgment. Accord- 
ingly, every effort was made to secure samples 
of $s who were deeply involved in the issue 
about which the items to be judged were writ- 
ten. Attitudes toward the social position of the 
Negro furnished an issue on which strong per- 
sonal involvement could be secured, particu- 
larly on the part of Negro 5s. The materials to 
be judged were the 114 statements originally 
used by Hinckley (8), and his instructions and 
procedures were duplicated. In addition to Ne- 
gro judges, groups of white Ss with known pro- 
Negro attitudes, "average" white Ss, and a small 
group of anti-Negro whites, were employed as 
judges. 

2. It was found that Negro 5s and militantly 
pro-Negro white Ss tended to place a dispropor- 
tionate number of statements in the extreme 
categories, in line with the predictions from 
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judgment theory that individuals with strong 
attitudes tend to see issues in “black and white” 
and displace neutral statements to the extremes. 
The numbers of statements assigned to the var- 
ious categories by "average" white Ss are much 
more uniform. 

3. Scale values for 11 statements spaced 
rather evenly over the entire scale in Hinckley's 
study were compared for each of the major 
groups. The distributions of scale values for the 
Negro and pro-Negro white Ss were compressed 
at the anti-Negro end of the scale, with a small 
number at the other extreme. Those for the 
“average” white Ss were spaced more evenly 
throughout the range in a manner closely ap- 
proximating the distribution found by Hinck- 
ley. Anti-Negro Ss tended to displace neutral 
statements to the extremes, particularly toward 
the pro-Negro end. 

4. The results support the notion that part 
of the discrepancy in results between the pres- 
ent study and those of previous experimenters 
may be attributable to the representation of a 
greater range of attitudes. In the experiment 
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27. A Comparison of the Thurstone and Likert Techniques 
of Attitude Scale Construction 


ALLEN L. EDWARDS and KATHRYN CLAIRE KENNEY 


This is a study in the methodology of attitude 
measurement; a comparison and evaluation of 
two methods of attitude scale construction. Al- 
though various techniques for the measurement 
of social attitudes have been suggested,! the two 
most frequently used methods are probably the 
“method of equal appearing intervals” devel- 
oped by Thurstone and Chave (18) and the 
“method of summated ratings”? developed by 
Likert (11). This study is concerned with the 
relative merits of the Thurstone and Likert 
techniques of scale construction. 


METHOD OF EQUAL APPEARING 
INTERVALS 


The method of equal appearing intervals 
begins with the collection of a variety of state- 
ments of opinion toward a particular issue 
which are then screened and edited in accord- 
ance with certain “informal criteria.” State- 
ments which appear to represent past rather 
than present attitudes are discarded or re- 
worded, as are statements which appear to be 
double-barreled? or which contain confusing or 
ambiguous concepts. Inspection should also ex- 


* Reprinted from Journal of Applied Psychology, 
1946, 30, 72-83, with permission of the senior author 
and the American Psychological Association. 

1 Excellent summaries can be found in Albig (I, 
pp. 181-218), Bird (3, pp. 149-167), LaPiere and 
Farnsworth (10, pp. 397-399), and Murphy, Murphy, 
and Newcomb (13, pp. 891-912). 

2 The term "method of summated ratings" was 
introduced by Bird (8, p. 159) to describe the pro- 
cedure followed by Likert. 

3 But in this connection see the discussion by 
Edwards (4, p. 578) which points to the possible 
value of statements which contain a "rationaliza- 
tion" clause. 


clude statements which might be approved by 
individuals with opposed attitudes.* 

After the statements have been edited, they 
are then presented to a group of judges who 
are instructed to sort them into various cate- 
gories to represent a scale ranging from ex- 
tremely favorable, through neutral, to extremely 
unfavorable expressions of opinion about the 
issue or institution in question. The judges are 
not asked to give their own opinions, but 
merely to estimate the degree of favorableness 
or unfavorableness expressed by each statement. 
When the sorting procedure is completed, tab- 
ulations are made indicating the number of 
judges who placed each item in each category. 
From these data accumulative proportions are 
computed for each item and ogives are con- 
structed. Scale values of the individual items 
are then read from the ogives, the value of each 
item being that point along the base line, in 
terms of scale value units, above and below 
which 50 per cent of the judges placed the item. 

A statistical criterion of the ambiguity of the 
items is provided in terms of the width of the 
range between the points on the scale marking 
off the 25th and 75th percentiles. This distance 
is called the Q value. A small Q value indicates 
that the middle 50 per cent of the judgments 
spread over a relatively small range or, in other 
words, that there is a good deal of agreement 
among the judges as to where the item belongs 
on the scale. A large Q value indicates lack of 
agreement among the judges and, indirectly, 
that something is probably wrong with the 
wording of the statement. 

Items are selected for the final scale on the 


4 A more detailed enumeration of the rules to be 
followed can be found in the monograph by 
Thurstone and Chave (18, pp. 56-58). 
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basis of the computed scale and Q values. An 
attempt is made to select about 20 or 22 items 
with low Q values and with scale values falling 
at relatively equally-spaced distances along the 
continuum. Two comparable forms of the scale, 
in terms of scale and Q values of the items in- 
cluded in each form, are constructed. The two 
forms are then given to a new group of subjects 
who are asked to check those statements with 
which they agree. The score for the individual 
subject is the mean or median scale value of the 
items which he has checked as being those with 
which he agrees. Reliability of the scales is 
found by correlating scores on the two forms 
of the scale. 


METHOD OF SUMMATED RATINGS 


The method of summated ratings also calls 
for a collection of various statements of opinion 
which are then edited in accordance with in- 
formal criteria similar to those used in the 
method of equal appearing intervals.5 After 
the elimination and editing of items failing to 
meet the prescribed standards, the remaining 
statements are presented to a group of subjects 
who are asked to respond to each one in terms 
of their own agreement or disagreement with 
the statement. Usually a | to 5 scale of response 
is used; subjects check whether they strongly 
agree, agree, are undecided, disagree, or strongly 
disagree with each statement. A score is given 
for each item depending upon the response 
made, The five possible responses may be 
weighted 1-2-3-4-5 or 5-4-3-2-1.6 Either 1 or 5 is 
consistently favorable or unfavorable, although 
the continuum is reversed in about half the 
Statements, That is, about half the statements 
are worded so that a strongly agree response in- 
dicates a favorable reaction to the issue in ques- 
tion, while the other half of the statements are 
worded so that a strongly agree response indi- 
cates an unfavorable reaction. The score for the 
individual subject is the sum of all scores for 
the separate items. 

In selecting items for the final scale, a cri- 
terion of internal consistency is used. Criterion 
groups consisting of the upper and lower 10 (or 
some other) per cent of the subjects in terms of 
total scores are compared to find whether the 


5 See Murphy and Likert (12, pp. 281-283) and 
Rundquist and Sletto (16, pp. 6-8) for a discussion 
of these standards. 

* This is a simplified method of scoring which was 
found to correlate 99 with the more complicated 
sigma method first used. 


individual items will differentiate between the 
two groups. The means of the upper and lower 
groups for each item are found; items which 
show the largest difference between the means 
of the two groups are retained in the final scale. 
Scales constructed by the method of sum- 
mated ratings usually contain about 20 to 25 
items, although Hall (8) has used scales with as 
few as 5, 7, and 10 items. Reliability of the 
scales is found by the split-half method of cor- 
relating scores for the odd versus even items. 


CRITICISMS OF THURSTONE’S METHOD 


The monograph by Thurstone and Chave, 
describing in detail the method of equal ap- 
pearing intervals for measuring attitudes, ap- 
peared in 1929. By the time Likert's monograph 
describing his technique appeared in 1932, the 
Thurstone procedure was generally recognized 
as a major, if not the most important, develop- 
ment in the field of attitude scale construction. 
It is important, therefore, if we are to compare 
the two methods, to examine the motivation 
behind Likerts departure from the by then 
already well-established Thurstone technique. 
Some indication of this is given by the follow- 
ing quotation from Murphy and Likert:? 


A number of statistical assumptions are made 
in the application of his (Thurstone's) attitude 
scales—e.g., that the scale values of the state- 
ments are independent of the attitude distribu- 
tion of the readers who sort the statements— 
assumptions which, as Thurstone points out, 
cannot always be verified. The method is, more- 
over, exceedingly laborious. It seems legitimate 
to inquire whether it actually does its work 
better than the simpler scales which may be 
employed, and in the same breath to ask also 
whether it is not possible to construct equally 
reliable scales without making unnecessary sta- 
tistical assumptions (12, p. 26).8 


7 We quote from Murphy and Likert rather than 
from Likert's original report because the Murphy 
and Likert publication is probably more readily 
available to the interested reader and since it con- 
tains, with but few corrections or omissions, the 
material originally reported by Likert and, in addi- 
tion, a more detailed report of applications of 
scales constructed by the technique. The passage 
quoted, with but minor changes, is the same as 
that appearing in Likert (11, p. 6) 

* The method of summated ratings also makes 
certain statistical assumptions as Murphy and 
Likert recognize (12, pp. 26 ff). Cf. also the paper 
by Ferguson (6). 


A COMPARISON OF THE THURSTONE AND LIKERT TECHNIQUES 251 


The main contentions of Murphy and Likert 
regarding the method of summated ratings seem 
essentially to be: (1) “it avoids the difficulties 
encountered when using a judging group to 
construct the scale” (12, p. 42); (2) “the con- 
struction of an attitude scale by the sigma 
method? is much easier than by using a judging 
group to place the statements in piles from 
which the scale values must be calculated” (12, 
p. 43); (3) “it yields reliabilities as high as those 
obtained by other techniques with fewer items” 
(12, pp. 42-43); (4) it gives results which are 
comparable to those obtained by the Thurstone 
technique.|? More generally, the method of 
summated ratings "seems to avoid many of the 
shortcomings of existing methods of attitude 
measurement, but at the same time retains most 
of the advantages present in methods now used" 
(12, p. 42). These claims, it should be noted, 
have been vigorously contested, notably by Bird 
(3) and Ferguson (7). For our part, we shall, in 
the sections which follow, attempt to evaluate 
them in the light of available evidence. 


INFLUENCE OF THE JUDGING GROUP 


Several studies cast doubt upon Murphy and 
Likert's criticism that the attitudes of the judg- 
ing group may influence the scale values of 
items when the method of equal appearing in- 
tervals is used. Using various approaches to the 
problem, these studies (5), (9), (15), seem to be 
in agreement that the attitude of the judging 
group is not a seriously disturbing factor. 
Hinckley's study (9), in particular, is clear cut. 
Groups of white students with differing attitudes 
toward the Negro were asked to sort items ex- 
pressing opinions about the Negro. A high pos- 
itive correlation was obtained between the scale 
values assigned to the items by the white stu- 
dents who were favorable and by those who 
were unfavorable in attitude toward the Negro. 
A high positive correlation was also obtained 
between scale values derived from judgments 
of an antagonistic white group and from the 
judgments of a group of Negroes. 


SIMPLICITY OF THE LIKERT METHOD 


Investigators who have used the Likert 
method seem to be in agreement that it is sim- 


® Later replaced by the even simpler 1 to 5 method. 
10 We have failed to find a specific statement to 
this effect, yet the idea seems implied in the para- 
graph quoted above. 


pler than the method of equal appearing in- 
tervals. Hall reports that he used the method 
of summated ratings in his survey of the atti- 
tudes of employed and unemployed men “be- 
cause of its relative simplicity and because it 
yields scales of high reliability” (8, p. 6), and 
Rundquist and Sletto, who used the Likert 
technique in constructing the Minnesota Survey 
of Opinions scales, agree that “it is less labori- 
ous than that developed by Thurstone” (16, 
p. 5). This evidence is, of course, in the nature 
of authoritative opinion, and Bird has raised 
some rather pertinent objections concerning it: 


Will the experimenter spend more time, too, in 
scoring every item and summating them in 
these long scales than another might spend de- 
termining the mean or median value by the 
Thurstone technique? Then too, is it actually 
less time-consuming to validate items in terms 
of selected groups than to determine the Q 
values from a curve or a distribution of scores? 
The claim of greater or lesser laboriousness 
seems to have been put forward without due 
regard for all processes in scaling techniques; 
but, in the interest of constructing refined mea- 
suring instruments, time can be neglected. 
There is much to be said in favor of a psychol- 
ogist’s refining his instrument before actually 
applying it to experimental groups. The argu- 
ment that the method of summated ratings is 
less laborious limps badly (3, p. 161). 


Bird's points are well taken, particularly in 
the case of scales constructed by the method of 
summated ratings which contain more than, let 
us say, 25 items. But most investigators who 
have used the method of summated ratings have 
not found any need for the "long scales" to 
which Bird is objecting. After our own experi- 
ence in constructing both Likert and Thurstone 
scales, we are inclined to agree with other in- 
vestigators that scales can be constructed by the 
method of summated ratings more quickly and 
with less labor than by the equal appearing in- 
terval method. We found, for example, that 
construction of the Thurstone scales required 
about twice as much time, exclusive of the time 
spent by the judging group in sorting the items, 
as did the Likert scale. It is unfortunate that 
this is but an estimate and that our records do 
not permit a more precise statement of the time 
factor—a point that should be checked in fu- 
ture research and reported. 
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A note of confusion has centered around the 
subject of reliability largely as a result of Li- 
kert's study of the reliability of a "Thurstone- 
type scale which was scored by both his and the 
Thurstone technique. The scale was given to a 
group of subjects with instructions to check the 
items in accordance with the usual Thurstone 
instructions. The same scale was then given to 
the subjects with instructions to check for each 
item one of the five alternatives (strongly agree, 
agree, undecided, disagree, strongly disagree) in 
accordance with the usual Likert instructions. 
Four of the items on the Thurstone scale were 
not adaptable to Likert-type responses and were 
omitted when the subjects were asked to check 
their reactions according to the method of sum- 
mated ratings scoring system. 

The reliability coefficient between the two 
forms of the scale (22 versus 22 items), when 
scored by the Thurstone method, was .88 (cor- 
rected). The reliability coefficient for the two 
forms (18 versus 18 items) as scored by the 
Likert method was .94 (corrected). What this 
demonstrates, of course, is that it is possible to 
take a scale constructed by the Thurstone tech- 
nique and to apply to most of the items the 
Likert method of scoring. But one critic seems 
to think that because of this finding, Likert 
erroneously concluded that “his technique is the 
better one" (7, p. 52). The higher reliability 
coefficient obtained by the Likert method of 
scoring, he adds, may be due to the fact that 
“increasing the number of steps in a psycho- 
logical scale increases reliability" (7, p. 52). As 
a matter of record, this is precisely the same 
explanation offered originally by Murphy and 
Likert (12, p. 55 and p. 47) for the higher 
reliability coefficient obtained by the 1 to 5 
method of scoring. The entire discussion, pro 
and con, on this point, it seems to us, has little 
bearing upon the question of whether the 
method of summated ratings or the method of 
equal appearing intervals will yield scales of 
higher reliability. The real problem concerns 
the reliabilities of scales constructed by the two 
methods, not the reliability of a particular scor- 
ing scheme ísolated from the technique of scale 
construction of which it is a part. And on this 
question there is ample evidence. 

Ferguson has quoted Thurstone as reporting 
the reliabilities of scales constructed by the 
method of equal appearing intervals, under his 
editorship, as being "all over .8, most of them 
being over .9" (6, p. 670). We do not know 


whether these coefficients are for scales of 20 or 
40 items, but Ferguson mentions that in his 
own studies he has found reliabilities for Thur- 
stone scales ranging from “.52 to .80 for the 
20-item forms and from .68 to .89 for the 40- 
item forms" (6, p. 670). If we take these coeffi- 
cients as representative, how do they compare 
with those reported for scales constructed by 
the method of summated ratings? 

Murphy and Likert found reliability coeffi- 
cients for their Internationalism Scale of 24 
items ranging from .81 to .90.11 Their Imperi- 
alism Scale of 12 items gave coefficients ranging 
from .80 to .92; the Negro Scale of 14 items 
yielded coefficients ranging from .79 to .91 (12, 
p. 48). Rundquist and Sletto report coefficients 
ranging from .78 to .88 for various scales of 22 
items each (16, p. 110). 

That Likert-type scales with even fewer items 
will give high reliability coefficients is indicated 
by Hall. Reliability coefficients for his religious 
scale of 10 items ranged from .91 to .93; for the 
scale of 7 items measuring attitude toward em- 
ployers the coefficients ranged from .77 to .87; 
and the morale scale of 5 items gave coefficients 
of .69 to .84 (8, p. 19). All of these coefficients 
compare favorably with those obtained from 
scales constructed by the method of equal ap- 
pearing intervals. According to the evidence at 
hand, there is no longer any reason to doubt 
that scales constructed by the method of sum- 
mated ratings and containing fewer items will 
yield reliability coefficients as high as or higher 
than those obtained with scales constructed by 
the Thurstone method. 


THE NEED FOR A JUDGING GROUP 


The confusion which followed Likert's re- 
scoring of a Thurstone-type scale by the 1 to 5 
method, unfortunately, has not been confined 
to the subject of reliability; it has spread to 
involve the question of whether or not there is 
need for a judging group in the construction 
of attitude scales. Ferguson seems to believe 


11 The reliability coefficients of the Likert scales 
are based upon split-half correlations, and all of 
those ed here have been "corrected" to indi- 
cate the reliability of the test taken as a whole. 
We feel that, at least for purposes of comparison, 
it is not valid to raise the coefficients for the 
Thurstone scales which are based upon equivalent 
forms of 20 to 22 items each. To do so would 
indicate the reliability to be expected from a 
Thurstone scale of 40 to 44 items, while in practice 
the scales generally used contain only half these 
numbers. 
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that Murphy and Likert implied, as a result of 
obtaining a higher reliability coefficient with the 
1 to 5 method of scoring than with the custom- 
ary Thurstone method of scoring, that they had 
demonstrated that the method of summated 
ratings does away entirely with the need for a 
judging group. He argues against this and bases 
his criticism on the following grounds: “Since 
the statements (used by Murphy and Likert in 
the above study) had been sifted through the 
sorting procedure (Thurstone’s), it would seem 
unjustifiable to conclude that Likert’s method 
did away with the need for a judging group. 
To test this point adequately one should com- 
pare scales constructed (independently of the 
Thurstone method) by the Likert technique 
with those constructed by the equal appearing 
interval method” (7, p. 52). 

We are in complete agreement with this ar- 
gument; therefore we find ourselves at a loss to 
understand the following statement of experi- 
mental design appearing in the same article: 


A more adequate test can be provided by re- 
scaling items using Thurstone’s method in scales 
constructed by Likert's technique. If Likert's 
technique does away with the need for a judg- 
ing group, the two methods of treating the 
statements should give the same result (7, p. 52). 


But this particular experimental design will 
not give a test of the two methods of scale con- 
struction; it is an investigation of where Likert- 
selected items will fall along the continuum 
posited by Thurstone or, stated somewhat dif- 
ferently, what Thurstone scale values will be 
attached to the particular items included in a 
particular Likert-type scale. 

What Ferguson found by following this line 
of investigation was that Likertselected items, 
when scaled according to the method of equal 
appearing intervals, failed to spread evenly over 
the scale continuum of Thurstone; the state- 
ments failed to represent all degrees of attitude 
but fell largely at the favorable and unfavor- 
able ends of the scale with the middle cate- 
gories neglected. Only one of the Likert-type 
scales which Ferguson attempted to scale by the 
Thurstone technique, an economic conservatism 
scale, gave a fairly even spread of items, and 
the correlation between the Thurstone and 
Likert methods of scoring this scale was .70.1 


12 Assuming that the reliability coefficient of this 
scale when scored by the Thurstone method is 
approximately that obtained when the scale is 


Because of these findings, the failure of the 
Likert-selected items to spread evenly over the 
Thurstone continuum and the “low” correla- 
tion between the Thurstone and Likert meth- 
ods of scoring the one scale that did, Ferguson 
believes that he has successfully demonstrated 
“that Likert’s technique for the construction of 
attitude scales does not obviate the need for a 
judging group” (7, p. 57). 

We cannot agree with this conclusion. What 
has been demonstrated, as we pointed out ear- 
lier, is that Likert-selected items do not neces- 
sarily fall at equally-spaced intervals along the 
theoretical continuum posited by Thurstone and 
Chave. That they do not may be of theoretical 
interest, but has little bearing upon the prac- 
tical problem of whether or not there is need 
for a judging group. This question can only be 
answered in terms of whether or not scales con- 
structed independently by each of the two 
methods will yield comparable scores, i.e., if an 
individual is a standard distance above the 
mean on one scale, he will be a comparable 
distance above the mean on the second. 

It might seem that the correlation of .70 be- 
tween the Thurstone and Likert method of 
scoring the economic conservatism scale would 
bear upon the problem. But this correlation is 
biased in that Ferguson failed to give the Thur- 
stone method a fair trial, ie, he limited the 
Thurstone scale to the items already selected 
by Likert's technique. Nor can we accept the 
correlations of .75 and .81 (corrected for atten- 
uation) which Murphy and Likert report for 
their Internationalism Scale and scores on the 
Thurstone-Droba War Scale. These correlations 
also fail to do justice to the question of whether 
comparable results can be obtained with inde- 
pendently constructed Thurstone and Likert 
scales since it is possible that the attitudes 
under consideration are not the same. 


COMPARATIVE STUDY OF THE TWO 
METHODS 


A valid comparison of the Thurstone and 
Likert techniques, we believe, must start with 
an original set of items, not with items already 
sifted by the Thurstone procedure and then 
scored by Likert's method, and not with items 
sifted by the Likert procedure and then scaled 
by the Thurstone technique. We believe also 


scored by the Likert method (reported by Rund- 
quist and Sletto as .85), and correcting for attenu- 
ation, the correlation would be .82. 
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that the same group of subjects should be used 
in the construction of the two scales, but that 
the steps for each method should be carried 
through independently. To carry out this com- 
parison, we used the original statements of 
opinion used by Thurstone and Chave in the 
construction of their scale designed to measure 
attitude toward the church. 

Subjects used in the construction of the scale 
were 72 members of an introductory psychology 
class at the University of Maryland.13 Half the 
class, selected at random, was asked to judge the 
degree of favorableness or unfavorableness ex- 
pressed by the statements in accordance with 
the Thurstone method, while the other half of 
the class was requested to give Likert-type re- 
sponses to the same statements. Two days later 
the procedure was reversed; the first half of the 
class gave Likert-type responses to the state- 
ments, while the other half gave Thurstone-type 
responses. The Seashore and Hevner (17) 
method of rating items was used instead of the 
Thurstone and Chave procedure of sorting 
items into piles.14 

In constructing the Thurstone scale, tabula- 
tions were made indicating the number of 
judges who placed each item in each of the 
categories. From these data accumulative pro- 
portions were determined and ogives constructed 
for each item. Scale values of the items were 
found by dropping a perpendicular to the base- 
line of scale values at the point where the curve 
crossed the 50 per cent level.!5 Q values were 
determined in a similar fashion by dropping 
perpendiculars at the 25th and 75th per cent 
levels, the Q value being the scale distance be- 
tween these two points. 

Items were selected for two “equivalent” forms 
of the scale, each form containing 20 items. Se- 
lection was made on the basis of Thurstone’s 


18 Although Thurstone and Chave used a much 
larger group of subjects, subsequent research (14) 
indicates that groups as small as 25 or 50 can be 
used to obtain scale values of items and that these 
values are very similar to those obtained with larger 
groups. 

14 Seashore and Hevner found that a technique of 
asking judges to rate statements on a scale instead 
of requesting that they sort the statements into 
piles yielded results which correlated very well 
with those obtained by Thurstone's original sort- 
ing procedure. See also the study by Ballin and 
Farnsworth (2). 

18 The correlation coefficient between our scale 
values and those obtained by Thurstone and Chave 
15 years earlier was 95. Our Q values, however, 
tended to differ considerably, the correlation co- 
efficient being only .18. 


informal standards, Q values, and scale values. 
Insofar as possible, the final scales contained 
items with low Q values and with scale values 
which were spread along the entire scale at rela- 
tively equally-spaced distances. Since only a few 
items, however, were found to have scale values 
near the center of the continuum and, at the 
same time, low Q values, this was not entirely 
possible. 

In constructing the Likert scale, a total score 
for each subject was found by summing the 
weights of responses given for each of the items. 
The upper and lower 10 subjects in terms of 
total scores served as the groups for applying 
the criterion of internal consistency. Since many 
of the original items would not meet Likert's 
a priori screening standards, we thought it pos- 
sible that total scores determined in part by 
these items would include in the criterion 
groups individuals who might otherwise not be 
represented, i.e., if these items had not been 
used in the scoring. Thurstone and Chave, we 
might emphasize, had included various ambigu- 
ous items in the original set in order to test Q 
as a means of statistically determining ambi- 
guity. Total scores, therefore, were first deter- 
mined by excluding those items which we felt 
did not meet Likert's criteria. Total scores were 
then found with these items included. Since we 
found that the criterion groups would contain 
essentially the same subjects using either score, 
the total scores based on all of the items were 
used. 

"Twenty-five items, all with a mean difference 
between the two criterion groups of 1.8 or 
higher, were selected for the final Likert scale. 
Approximately half of these items were weighted 
5 for a strongly agree response and half were 
weighted 1 for a strongly agree response. Of the 
25 items selected for the Likert scale, $ were also 
used in Form A of the Thurstone scale and 2 
were used in common with Form B. 


RELIABILITY AND COMPARABILITY 


To obtain data on the reliabilities of the 
scales and to find out the relationship existing 
between scores on the independently con- 
structed Likert and Thurstone scales, members 
of another introductory psychology class and an 
applied psychology class at the University of 
Maryland were tested. One group of subjects 
was presented with the Thurstone scales fol- 
lowed by the Likert scale; for the second group 
of subjects the order of presentation was re- 
versed. There were 80 subjects altogether, each 
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group containing approximately half of this 
number, 

The reliability coefficient for the Likert scale 
of 25 items was .94. This coefficient compares 
favorably with those usually reported for scales 
constructed by this method. The reliability 
coefficient for the equivalent forms of the Thur- 
stone scales of 20 items each was .88. This is 
comparable to the reliability coefficients of .85 
and .89 which Thurstone and Chave originally 
reported for scores on their Form A and B for 
two different groups of subjects (18, p. 66). 

The correlation coefficient between scores on 
the Likert scale and Form A of the Thurstone 
was .72, which, when corrected for attenuation, 
becomes .79. On the other hand, the correlation 
between the Likert scale and Form B of the 
Thurstone was .92. When corrected for attenu- 
ation the coefficient indicates a perfect relation- 
ship. Unfortunately, we have no way of 
knowing which of these two coefficients is more 
representative of the "true" relationship exist- 
ing between scores on independently constructed 
Likert and Thurstone scales in general. But the 
coefficient of .92 between the Likert and one 
of the Thurstone scales is surely sufficiently high 
to establish the fact that it is possible to con- 
struct scales by the two methods which will yield 
comparable scores. This is the question we set 
out to answer. 


SUMMARY AND CONCLUSIONS 


Now if we go back and examine the points 
on which we compared the Thurstone and 
Likert techniques of scale construction, we 
reach the following conclusions: 


l. The evidence available indicates that the 
attitude of the judging group is not an impor- 
tant factor determining the scale values of items 
sorted by the Thurstone technique.!^ 


16 We are not satisfied with the evidence on this 
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28. A Consideration of Beliefs, and Their Role in 


Attitude Measurement 


MARTIN FISHBEIN 


Two persons who are equally opposed to 
segregation may have quite different concep- 
tions of its nature, causes, and consequences, 
and may hold different views concerning the 
actions which should be taken to eliminate 
segregation. In the language of this paper, 
these two persons are said to have the same 
attitudes toward segregation but to hold dif- 
ferent beliefs about it. Attitudes are learned 
predispositions to respond to an object or class 
of objects in a favorable or unfavorable way. 
Beliefs, on the other hand, are hypotheses con- 
cerning the nature of these objects and the 
types of actions that should be taken with 
respect to them. 

Many writers do not maintain this distinc- 
tion between attitudes and beliefs. Both no- 
tions are commonly subsumed under the single 
term “attitude,” which is said to have affective 
(evaluative), cognitive, and conative (action) 
components. According to this view, an "atti- 
tude” toward segregation would include not 
only a person’s negative feelings toward segre- 
gation, but also his ideas about its causes and 
implications, and his conviction that it should 
be attacked through mass demonstrations and 
legislation. There is, of course, no overwhelm- 
ing reason why the word “attitude” should not 
mean all of these things. Words are created by 
man for his own convenience, and they may 
be used in any manner that man finds appro- 
priate, But certain considerations lead us to 
believe that “attitude” is a more useful scien- 
tific word when it is given restricted meaning. 


* Revised and expanded for this book from “A 
Consideration of Beliefs, Attitudes, and Their 
Relationships,” in I. D. Steiner and M. Fishbein 
(Eds), Current Studies in Social Psychology, Holt, 
Rinehart, and Winston, New York, 1965, with per- 
mission of the publisher. 


It is obvious that affect, cognition, and action 
are not always highly correlated. Different peo- 
ple like the same thing for different reasons, and 
a thin man who likes pastries may eat them be- 
cause he believes they contain many calories 
while a fat man may spurn them for exactly the 
same reason. Consequently, a multi-component 
conception of attitude turns out to be a multi- 
dimensional conception, and the “attitude” of 
any one person toward an object or concept may 
fall at three very different positions on three dif- 
ferent dimensions. But the operations by which 
attitudes are measured almost invariably yield 
a single score which is unlikely to reflect these 
three different components in any very precise 
fashion. As a matter of fact, people who con- 
struct "attitude scales" rarely maintain that 
their instruments are measuring three com- 
ponents; instead, they usually contend that 
their scales indicate people's evaluations (pro- 
con) of objects or concepts. Thus, although 
"attitudes" are often said to include all three 
components, it is usually only evaluation or 
"the affective component" which is measured 
and treated by researchers as the essence of 
attitude. 

It is the contention of this paper that in- 
creased precision and understanding can be 
gained by bringing our definition of attitude 
into closer harmony with the techniques by 
which attitudes are measured. Multi-dimen- 
sional concepts are notoriously difficult to em- 
ploy in rigorous theory, and they create almost 
unmanageable problems when theory is trans- 
lated into research. A conceptual system in 
which only the affective component is treated 
as attitudinal, and the other two components 
are linked to beliefs, should permit a more 
productive approach to the study of attitudes. 
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Elsewhere in this book (pp. 108 to 116) C. E. 
Osgood has described techniques for measuring 
the meanings which objects or concepts have 
for people. Subjects rate the object or concept 
on a series of seven-step bipolar scales, the ends 
of which are identified by adjectives such as 
"good-bad," "'clean-dirty," "strong-weak," or 
"fastslow." Ratings obtained on a very large 
number of these scales have been subjected to 
a procedure called factor analysis which reveals 
the underlying dimensions of meaning along 
which people see various objects as falling. It 
is significant that Osgood has invariably found 
that one of the most critical aspects of meaning 
is an evaluative dimension. A very large share 
of the meaning of objects or concepts appears 
to be determined by the position which the 
object or concept is seen to occupy on a good- 
bad or pro-con continuum. 

After reviewing many of the earlier defini- 
tions of attitude, Osgood and his associates (i.e., 
Osgood and Tannenbaum, 1955; Osgood, Suci, 
and Tannenbaum, 1957) concluded: 


It seems reasonable to identify attitude, as it 
is ordinarily conceived in both lay and scien- 
tific language, with the evaluative dimension 
of the total semantic space. .. . The meaning 
of a concept is its location in a space defined 
by some number of factors or dimensions, and 
attitude toward a concept is its projection onto 
one of these dimensions defined as “evaluative.” 

Osgood’s definition of attitude has many ad- 
vantages. First, it has the virtue of treating 
attitude as a unidimensional concept. It equates 
attitude with the evaluative meaning of an 
object or concept, i.e. attitude refers only to 
the "evaluation" of a concept—its "favorable- 
ness” or “unfavorableness,” its "goodness" or 
“badness.” Other types of meaning which the 
object may have for a person (e.g, its size, 
shape, speed, etc.) are excluded from the no- 
tion of attitude except insofar as they may in- 
fluence the placement which the person gives 
the object on the evaluative dimension. Second, 
as Osgood et al. (1957) point out, "every point 
in semantic space has an evaluative component 
(even though the component may be of zero 
magnitude when the evaluative judgments are 
neutral)." Thus, this definition makes it clear 
that with respect to any object or concept, an 
individual has a positive, negative, or neutral 
attitude. Third, it is consistent with the descrip- 
tion of attitude as a learned predisposition to 


respond to a stimulus in a favorable or un- 
favorable way. According to Osgood, this defi- 
nition characterizes attitude as a mediating 
evaluative response, i.e., a learned implicit re- 
sponse that varies in intensity and tends to 
"mediate" or guide the individual's more overt 
evaluative responses to an object or concept. 

Considerable evidence supporting the relia- 
bility and validity of Osgood's instrument for 
measuring attitudes (called the Semantic Dif- 
ferential) has been presented by Osgood et al. 
(1957) and Fishbein and Raven (1962). Although 
the Semantic Differential is not the only tech- 
nique by which unidimensional attitude scores 
may be obtained, it provides a clear operational 
definition of the term "attitude" as it will be 
used in this paper. 


Il. BELIEF 


In another paper in this book, Fishbein and 
Raven (pp. 183 to 189) suggested a definition 
of belief that is analogous to the definition of 
attitude. According to these investigators, an 
individual may not only evaluate a concept 
(Le, view it as "good" or "bad," "clean" or 
“dirty,” etc) but also he may believe or dis- 
believe in the existence of the concept (i.e, 
view it as "existent" or “non-existent,” "prob- 
able" or "improbable," etc). As we have seen 
above, the first type of judgment has been 
viewed as a measure of the evaluative dimen- 
sion of a concept, or more specifically, as an 
attitude. The latter type of judgment may be 
viewed as a measure of the probability dimen- 
sion of a concept, or more specifically, as a 
"belief." Just as Osgood and his associates dem- 
onstrated that valid and reliable measures of 
attitude could be obtained by having the sub- 
ject judge the concept on a series of bipolar 
evaluative scales, Fishbein and Raven demon- 
strated that valid and reliable measures of be- 
lief could be obtained by having the subject 
judge the concept on a series of bipolar proba- 
bilistic scales (e.g., probable-improbable, likely- 
unlikely, possible-impossible, etc). It is this 
definition of belief, ie, the position of the 
object or concept on the probability dimension, 
that will be used throughout this paper. 

At first glance, however, it may appear that 
this conception of belief is a highly specialized 
one. That is, this definition of belief appears 
to be concerned only with the probability of 
the existence of an object—it does not appear 
to deal with the type of belief that most psy- 
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chologists have considered. Particularly in the 
area of attitude research, almost all psycholo- 
gists have been concerned with beliefs about 
the object. As was pointed out above, both the 
“cognitive” and “action” components of atti- 
tude can be viewed as beliefs about the object. 
The cognitive component refers to beliefs 
about the nature of the object and its relations 
to other “objects,” while the action component 
refers to beliefs about what should be done 
with respect to the object. 

Fishbein and Raven recognized this problem 
and suggested a distinction between belief in 
an object and belief about an object. “Thus 
far, we have defined belief in, or more com- 
pletely, belief in the existence of, an object. 
One could also consider belief in the existence 
of a relationship between that object and some 
other object or some quality.” Thus, just as 
an individual may believe or disbelieve in the 
existence of "God," so too might he believe 
or disbelieve in the existence of relationships 
involving "God," eg. “God is omnipotent,” 
"God is omniscient,” "God is vengeful,” “God 
had a son,” etc. Any of these statements could 
be placed above the probability scales, and the 
belief in the existence of that particular rela- 
tionship could be measured. “The various be- 
liefs in the relationships between an object and 
other objects or qualities would then be de- 
fined as beliefs about that object. While belief 
in refers to the existence of an object, belief 
about deals with the nature of that object, the 
manner in which it exists.” 

Thus, like beliefs in, beliefs about an object 
are also equated with the probability dimension 
of a concept. However, in the latter case, the 
“concept” is a relational statement, i.e. a state- 
ment associating the object of belief with some 
other object, concept, value, or goal. In gen- 
eral, then, a belief about an object may be 
defined as the probability or improbability that 
a particular relationship exists between the 
object of belief and some other object, concept, 
value, or goal. 

This definition of beliefs about an object is 
consistent with the descriptions of beliefs that 
most investigators have suggested. That is, most 
investigators have reached implicit or explicit 
agreement that a belief about an object may 
be described as a relationship between the ob- 
ject (of belief) and some other object, value, 
goal, or “concept.” Indeed, most investigators 
are agreed that any belief about an object may 
be diagrammed as (X) ———— (Y), where (X) 


refers to the object of belief, (Y) refers to some 
other object or concept, and the line, i.e., 
represents the relationship or asser- 
tion linking (X) and (Y). Thus, a belief about 
an object can, in part, be described as any 
statement that relates the object of belief to 
some other object, value, concept, or goal. 
This description, however, is really a definition 
of a belief statement; the belief per se is the 
position that an individual ascribes to the state- 
ment on the probability dimension, i.e., the 
probability or improbability that the particular 
relationship expressed in the statement does 
exist. 

This conception of beliefs about objects ap- 
propriately represents the many different types 
of beliefs that an individual may hold. For 
example, consider the following different types 
of beliefs that have been referred to in the 
literature: 


l. Beliefs about the component parts of the 
object. 

2. Beliefs about the characteristics, qualities, 
or attributes of the object. 

3. Beliefs about the object's relation with 
other objects or concepts. 

4. Beliefs about whether the object will lead 
to or block the attainment of various goals or 
"valued states." 

5. Beliefs about what should be done with 
respect to the object. 

6. Beliefs about what the object should, or 
should not, be allowed to do. 


With respect to the “object” Negro, these six 
types of belief may be illustrated as follows: 


l. (Negroes) have (dark skin). 

2. (Negroes) are (athletic). 

3. (Negroes) are equal to (white men). 

4. (Negroes) inhibit (free expression of ideas). 

5. (Negroes) should be (respected). 

6. (Negroes) should not be allowed to hold 
(government jobs). 


It should be noted that these six types of 
beliefs are the kinds of beliefs that have typi- 
cally been considered as comprising the “cog- 
nitive” and "action" components of attitude. 
That is, when most investigators are talking 
about an individual's "cognitive structure" or 
the "cognitive component,” they are referring 
to beliefs to Type 1, 2, 3, or 4. Similarly, when 
they are talking about the "action component," 
they are usually referring to beliefs of Type 5 
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or 6.1 In the present paper, however, rather 
than viewing “belief” as a part of “attitude,” 
these concepts have been described and defined 
independently. "Attitude" has been charac- 
terized as a learned, implicit response that 
mediates evaluative behavior, and has been 
operationally defined as "a concept's position 
on the evaluative dimension." Similarly, “be- 
lief" was operationally defined as "a concept's 
position on the probability dimension," and a 
distinction between beliefs in and beliefs about 
an object has been made. Specifically, belief in 
refers to the probability that a particular ob- 
ject or concept exists, and belief about refers 
to the probability that a particular relation- 
ship involving the object or concept exists. 
Thus, like attitude, belief is defined as unidi- 
mensional concept, i.e., the term “belief” refers 
only to the "probability" or “improbability 
that a particular object (belief in) or a particu- 
lar relationship (belief about) exists. In addi- 
tion, we have seen that beliefs about objects 
may be described as relationships between the 
object of belief and any other object, concept, 
value, or goal. In the following section, we 
shall consider some of the implications of this 
definition and description of beliefs about an 
object. 


HL BELIEFS ABOUT OBJECTS 


In the discussion of attitude, it was pointed 
out that every concept contains an evaluative 
component. That is, we have attitudes toward 
all concepts or objects. Thus, in considering 
a belief statement, an individual not only has 
an attitude toward the object of belief, but 
he also has an attitude toward the "related 
object." For example, with respect to the state- 
ment that “Negroes are athletic," an individual 
has an attitude toward "athletic" as well as 
toward "Negroes." Similarly, in the statement, 
"Negroes should not be allowed to hold gov- 


1 It should be noted, however, that in addition to 
beliefs about the attitude object, the "action com- 
ponent" has also been viewed in terms of beliefs 
about "the self.” That is, subjects are often asked 
to indicate whether or not they would engage in 
certain activities with the attitude object (see 
Triandis' contribution to this book, pp. 208 to 219). 
For example, a subject might be asked to indicate 
whether hé “would” or “would not" “res 
Negroes.” Diagrammatically, this type of belief may 
be represented as follows: 

(I) would (respect Negroes). 
a would not (allow Negroes to hold government 
jobs) 


ernment jobs,” an individual has an attitude 
toward “government jobs.” The attitude associ- 
ated with the related concept shall henceforth 
be referred to as the evaluative aspect of a 
belief about an object. Thus, every belief about 
an object contains a positive, negative, or neu- 
tral evaluative aspect. In other words, every 
belief statement relates the object of belief to 
some other concept that is positively, nega- 
tively, or neutrally evaluated by an individual. 

Further, the relation or assertion between 
the object of belief and the related object may 
be either positive or negative, "associative" or 
“dissociative.” “Examples of associative (posi- 
tive) relations are: is, has, includes, likes, 
produces, implies. Examples of dissociative 
(negative) relationships are: avoids, hates, hin- 
ders, defeats, destroys, is incompatible with.” 
(Abelson, 1959, p. 345). 

Thus, there are six basic types of belief 
statements: 


1. The attitude object is positively associated 
with a positively evaluated concept. 

2. The attitude object is positively associated 
with a neutrally evaluated concept. 

3. The attitude object is positively associated 
with a negatively evaluated concept. 

4. The attitude object is negatively associ- 
ated with a positively evaluated concept. 

5. The attitude object is negatively associ- 
ated with a neutrally evaluated concept. 

6. The attitude object is negatively associ- 
ated with a negatively evaluated concept. 


Each of these statements implies a favorable, 
neutral, or unfavorable attitude toward the 
attitude object. That is, as Thurstone (1928) 
and others have pointed out, beliefs (or 

"opinions") may be viewed as expressions of 
attitude; beliefs are often used “as the means 
for measuring attitude.” Specifically, statements 
of Type 1 and Type 6 indicate favorableness 
toward the attitude object. Thus, if an indi- 
vidual holds beliefs of Type 1 or Type 6, this 
would suggest that he has a positive attitude 
toward the object. Similarly, statements of Type 
2 and Type 5 imply neutral evaluations of the 
attitude object, while statements of Type 3 and 
Type 4 indicate negative evaluations. The more 
favorable or unfavorable the evaluative aspect 
of the belief statement, the more favorable or 
unfavorable is the attitude indicated. It is im- 
portant to note, however, that if one is to judge 
whether a given statement suggests a favorable 
or unfavorable attitude, it is necessary to know 
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how the person making the statement (i.e., 
holding the belief) evaluates the related concept. 
For example, most people would agree that if 
an individual believed that "Negroes are equal 
to white men," this would suggest that he had 
a positive attitude toward Negroes. This judg- 
ment, however, is based on an implicit assump- 
tion that all individuals evaluate "white men" 
positively. Yet, it is possible to argue that for 
some individuals, holding this belief, i.e., agree- 
ing with this statement, actually indicates a 
negative attitude toward Negroes. Consider 
the case of an Oriental respondent who has a 
strong negative attitude toward “white men.” 
If this person believed that “Negroes are equal 
to white men,” the belief would suggest that 
he had a negative attitude toward Negroes. 
That is, from his point of view, he is agreeing 
with a statement that associates Negroes with 
a negatively evaluated concept (ie. a Type 3 
statement). Thus, although two people may 
hold the same belief, this belief may mean 
quite different things to the people; and there- 
fore, it may suggest that they hold quite dif- 
ferent attitudes. 

It is for this reason that careful attention is 
paid to item selection when one is attempting 
to construct an attitude scale. An attempt is 
made to select only those items that will have 
the same meaning for all respondents. For 
example, Thurstone Scales are usually con- 
structed through the use of the methods of 
successive categories and equal-appearing inter- 
vals. That is, the experimenter selects a large 
number of statements involving the attitude 
object. These statements are then presented to 
judges who are asked to sort the statements 
into eleven categories ranging from “l,” the 
statement indicates extreme unfavorableness 
toward the attitude object, to "6," the state- 
ment indicates neither favorableness nor un- 
favorableness toward the attitude object, to 
“11,” the statement indicates extreme favorable- 
ness toward the attitude object. Essentially 
then, the judge is presented with a large num- 
ber of belief statements; and he is asked to 
judge whether each of these statements indi- 
cates an unfavorable, favorable, or neutral atti- 
tude towards the attitude object. Since the 
major determinants of the degree of favorable- 
ness or unfavorableness indicated by a state- 
ment are (1) the nature of the association 
(positive or negative) and (2) its evaluative 
aspect (favorable, neutral, or unfavorable), and 
since the nature of the association is specified, 


one of the major variables determining where 
a judge will place a statement is his evaluation 
of the related concept. Only those statements 
that result in a consistent sorting by the judges 
are retained. Wherever there is much disagree- 
ment among the judges' ratings, the item is 
eliminated. Each statement that is retained is 
given a value equal to the mean or median 
of the category placements it has received from 
the judges. The experimenter then selects a 
final set of items that range all along the 
favorable-unfavorable continuum, ie. items 
that link the attitude object with other objects 
which represent varying positions on the evalu- 
ative dimension. 

It is important to note, however, that the 
discarded items are mot eliminated because 
they don't indicate favorableness or unfavor- 
ableness toward the attitude object. Indeed, it 
is because different judges see them as indicat- 
ing different attitudes that they are eliminated. 
"Thus, discarded items are not ordinarily irrele- 
vant to the attitude which one wishes his in- 
strument to measure. But items which have 
different attitudinal significance for different 
judges are likely to have uncertain attitudinal 
significance for other people as well. For ex- 
ample, a statement such as “Negroes are happy- 
go-lucky,” is likely to be eliminated from the 
sample. This is because some judges will evalu- 
ate being "happy-go-lucky" as good, others will 
evaluate it negatively, and still others will con- 
sider it neutral. Because the statement has dif- 
ferent evaluative aspects for different judges, 
it will be placed in quite different categories. 
On the other hand, a statement such as 
"Negroes are intelligent" is likely to be re- 
tained. That is, almost all judges will be in 
agreement that being "intelligent" is good; 
and thus, almost all judges will view the state- 
ment as indicating a favorable attitude. If one 
knew how a given individual evaluated being 
“happy-go-lucky,” his agreement or disagree- 
ment with the statement “Negroes are happy- 
go-lucky” would be just as good an indication 
of his attitude as would his agreement or dis- 
agreement with the statement “Negroes are 
intelligent.” It should be noted, however, that 
no one belief is likely to be a very reliable or 
valid “indicator” of attitude. As we shall see 
below, it is only when many beliefs are con- 
sidered collectively that reliable estimates of 
attitude can be obtained. 

Although the above discussion pointed out 
that the items eliminated on the basis of the 
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judges’ decisions are not irrelevant to an indi- 
vidual’s attitude, Thurstone did establish a 
“criterion of irrelevance.” Unlike the technique 
described above, this criterion, rather than be- 
ing based on judgments of favorableness and 
unfavorableness, is based on subjects’ agree- 
ment or disagreement with items that have 
already received scale values. That is, this cri- 
terion is only considered after a scale has 
been constructed through the method described 
above. Essentially, this criterion leads to the 
elimination of items that are responded to in 
the same manner (i.e., agreed with or disagreed 
with) by subjects with very different attitudes. 
More specifically, it should be recalled that 
each item selected for inclusion on a Thurstone 
Scale presumably represents a given point on 
an attitude continuum (ranging from favorable- 
ness to unfavorableness). In addition, each sub- 
ject may also be viewed as falling on some 
point along that continuum. In order for an 
item to meet the criterion of relevance, the 
greatest amount of agreement with the item 
should come from subjects whose attitudes fall 
on the same point of the continuum as does 
the item. The bigger the discrepancy between 
the subject's attitude (ie, his scale position) 
and the attitude expressed by the item (i.e., the 
item's scale position), the smaller should be 
the probability that the subject will agree with 
the item. Thus, for example, if subjects at 
every point on the continuum were equally 
likely to agree with a given item, the item 
would be eliminated as "irrelevant." Similarly, 
if subjects at two different points of the con- 
tinuum (e.g., subjects with quite favorable and 
quite unfavorable attitudes) had higher proba- 
bilities of agreeing with an item than subjects 
at any other point, the item would be elimi- 
nated. Although little emphasis has been given 
to this criterion in Thurstone Scaling, it is 
essentially equivalent to the major criterion 
used in selecting items for a Likert Scale. Thus, 
let us briefly consider the construction of a 
Likert Scale. 

As in Thurstone Scaling, the experimenter 
starts with a large set of items that relate the 
attitude object to other objects, concepts, etc. 
(ie. belief statements). Rather than giving 
these items to judges, the experimenter simply 
decides which items indicate favorableness and 
which items indicate unfavorableness to the 
attitude object. If a statement indicates nei- 
ther favorableness or unfavorableness to the 
experimenter, or if he has difficulty in deter- 


mining whether the statement is favorable or 
unfavorable, it will usually be eliminated im- 
mediately. Having made these decisions, the 
experimenter then gives the set of remaining 
items to a group of subjects, who are asked to 
indicate their degree of agreement or disagree- 
ment with each statement (usually by respond- 
ing to each item on a five-point scale ranging 
from "strongly agree" to "neither agree nor 
disagree" to "strongly disagree"). For statements 
judged as indicating favorableness toward the 
attitude object, "strong agreement" is given 
the value 5 and "strong disagreement" is given 
the value 1. The order of scoring is reversed 
for statements indicating unfavorableness to- 
ward the attitude object, i.c, "strong disagree- 
ment" is given the value 5. A total score is 
then obtained for each subject by summing his 
scores on each of the individual items. It is 
at this point that item selection actually begins. 
While many methods of item analysis have 
been used as a basis for rejecting statements, 
the most common (and probably the simplest) 
technique involves selecting the 25 per cent of 
the subjects with the highest total scores and 
also the 25 per cent of the subjects with the low- 
est total scores. It is assumed that these two 
groups provide criterion groups in terms of 
which the individual items can be analyzed. 
"That is, it is assumed that the subjects in these 
two groups have maximally different attitudes 
toward the object; and thus, if the item is a 
good one, it should discriminate between the 
two groups. More specifically, the mean score 
on a given statement for the high group should 
be significantly different than the mean score 
on the same statement for the low group. If 
the mean scores are not significantly different, 
the item is eliminated. Thus, similar to Thur- 
stone's criterion of irrelevance, if an item is 
responded to in the same way by subjects with 
different attitudes, the item is considered as a 
"bad" or "irrelevant" item, and is eliminated 
from the instrument. 

After running significance tests on each item, 
the experimenter then constructs his final in- 
strument by selecting the 20 (or 25) items that 
were the best discriminators (i.e., produced the 
largest significant differences between the 
groups). In addition, he attempts to include 
an equal number of statements that indicate 
favorableness and unfavorableness so that pos 
sible response sets of the subjects will be 
minimized. 

Here again, however, it should be noted that 
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the items eliminated from Likert Scales on the 
basis of item analysis or from Thurstone Scales 
on the basis of the criterion of irrelevance are 
not necessarily "bad" or "irrelevant" attitude 
items. They are simply being eliminated be- 
cause people with different attitudes tend to 
show the same amount of agreement with them. 
That is, the assumption is being made that if 
two people have different attitudes, but hold 
the same belief, then, "by definition," the be- 
lief is viewed as irrelevant and unrelated to 
their attitudes. As we saw above, however, this 
is a completely false assumption since the same 
belief may have totally different attitudinal 
significance for different individuals. In other 
words (similar to items eliminated by judges 
in Thurstone Scaling), the item is not being 
eliminated because it has no attitudinal sig- 
nificance (i.c, because it is unrelated to atti- 
tude), but rather it is eliminated because it 
does not indicate the same attitude for all 
individuals. For example, suppose an experi- 
menter was constructing a Likert scale to mea- 
sure attitudes toward a political candidate. 
One of the items that comes to his attention 
is the item "Candidate X is opposed to atmo- 
spheric nuclear testing." Being violently op- 
posed to atmospheric nuclear testing himself, 
he (i.e, the experimenter) views the statement 
as one that is favorable toward “Candidate X,” 
and thus leaves it in the set he will initially 
present to his subjects. Further, let us assume 
that Candidate X really is in favor of ending 
atmospheric nuclear testing, and has made his 
position clear to the public. Thus, all subjects 
will tend to agree strongly with this item. 
Since it does not discriminate between indi- 
viduals with favorable and unfavorable atti- 
tudes toward Candidate X, the item will be 
eliminated. But this does not mean that the 
belief has no attitudinal significance. On the 
contrary, the belief may be strongly related to 
the individual's attitude; and it can serve as 
a good indicant of it. That is, many subjects, 
like the experimenter, will view “atmospheric 
nuclear testing” as a very bad thing; and thus 
their agreement with the statement “Candidate 
X is opposed to atmospheric nuclear testing” 
indicates that they have a favorable attitude 
toward Candidate X. (Indeed, holding this be- 
lief may actually be one of the reasons they 
are favorable to Candidate X.*) Many other 


2 For a discussion of beliefs as determinants of 
attitude, see Rosenberg (pp. 325 to 331 in this 
book) and Fishbein (pp. 389 to 400) in this book. 


subjects, however, may see “atmospheric nu- 
clear testing” as a very good thing. Thus, their 
agreement with the statement indicates an 
unfavorable attitude toward Candidate X. 
(Here, too, holding this belief may be a reason 
for being unfavorable.) Clearly then, if one 
knew the way a given individual evaluated 
“atmospheric nuclear testing,” his belief that 
"Candidate X is‘opposed to atmospheric nu- 
clear testing" would be as good an indicant of 
his attitude toward Candidate X as would any 
belief statement that met the criterion for in- 
clusion on the Likert Scale. Although no one 
belief is likely to be a very reliable or valid 
indicator of attitude, all beliefs about an ob- 
ject may be viewed as indicants of attitude 
toward that object. It is unfortunate, however, 
that the necessity of obtaining items that have 
the same attitudinal meaning for all individ- 
uals often leads to the elimination of precisely 
those items that are probably most strongly 
related to attitude, and thus would serve as 
the best indicants of attitude. That is, as in 
the example given above, it will often be the 
case that many individuals will hold the same 
beliefs about any given object. For example, 
during the 1964 Presidential election, most of 
the population of the United States held at 
least the following six beliefs about Lyndon 
Johnson: 


1. He is a Democrat. 

2. He is in favor of the anti-poverty bill. 

3. He is in favor of Medicare. 

4. He is in favor of our present foreign pol- 
icy in Viet Nam. 

5. He is in favor of increased social security 
benefits. 

6. He is in favor of the Nuclear Test Ban 
Treaty. 


Although most people held these same beliefs, 
the evaluative aspects of these beliefs differed 
considerably among the people (ie, some 
would evaluate “being a Democrat” positively 
and some negatively; some would evaluate one 
of the issues—Medicare, increased social secu- 
rity benefits, our present foreign policy in Viet 
Nam—positively, others would evaluate the 
same issue negatively). Thus, although most 
people held the same beliefs, these beliefs were 
quite likely to lead to different attitudes? Yet, 
none of these statements would appear on 
either a Thurstone or a Likert Scale because 
they do have different attitudinal significance 


3 Ibid. 
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for different people. We shall return to this 
point below. 

Consideration of the types of statements that 
are retained or eliminated from attitude scales 
also points out another important aspect of 
attitude research. Many investigators have at- 
tempted to distinguish between belief and atti- 
tude on the basis of “affect.” Krech and 
Crutchfield (1948), for example, have stated 
that “beliefs, as such, are motivationally and 
emotionally neutral . . . attitude can be de- 
fined as either ‘pro’ or ‘anti’ while beliefs are 
conceived as neutral.” Similarly, Katz and Stot- 
land (1959) have asserted that a belief is not 
an attitude “unless there is an attribution of 
good or bad qualities accompanying the spe- 
cific belief.” As was pointed out above, how- 
ever, all belief statements may be viewed as 
indicating some degree of favorableness or un- 
favorableness toward the attitude object. Fur- 
ther, a statement that may contain an implicit 
or explicit neutral evaluation of the attitude 
object for one person may contain an implicit 
or explicit positive or negative evaluation for 
another person. 

In addition, the attempt to distinguish be- 
tween belief and attitude on the basis of 
“affect” has resulted in much confusion when 
one attempts to distinguish between a belief 
and an attitude. For example, consider the 
following statements: 


l. (Negroes) are (good). 

2. (Negroes) are (dirty). 

3. (Negroes) are (musical). 

4. (Negroes) are (superstitious). 
- (Negroes) are (tall). 

6. (Negroes) are (dark-skinned). 


ex 


Many investigators would say that statement 
"I" and probably statement "2" express an 
"attitude," Similarly, there would be general 
agreement that statements "5" and “6” express 
“beliefs.” With respect to statements “3” and 
"4" however, little agreement would be 
reached. Some investigators would say these 
statements reveal "beliefs" while others would 
associate them with "attitudes." 

From the point of view presented in this 
paper, all of the statements are belief state- 
ments—all of the statements consist of an asso- 
ciative relationship between "Negro" and some 
other "concept." Further, there is an attitude, 
an evaluative response, associated with each 
of these related concepts; each of these state- 
ments contain an evaluative aspect. As Green 
(1954) has pointed out, the concept of “atti- 


tude” is a hypothetical variable, abstracted 
from the many statements and actions that an 
individual makes with respect to a given object. 
Thus, none of the above statements may be 
viewed as an attitude. All, however, serve as 
indicators of attitude. If an individual posi- 
tively evaluates a “concept,” and if he believes 
that there is an associative relationship between 
“Negro” and that “concept,” his holding of 
this belief suggests that he has a positive atti- 
tude toward Negroes. Similarly, if an individual 
negatively evaluates a “concept,” his belief that 
there is an associative relationship between 
“Negro” and that “concept” suggests he has a 
negative attitude toward Negroes. Although 
each belief suggests an attitude, the attitude 
per se can only be reliably abstracted by con- 
sidering the many beliefs an individual holds. 

The problem is that most investigators have 
overlooked the fact that, for a given individual, 
all concepts have an evaluative component. 
Generally speaking, they have tended to label 
agreement with a statement or the statement 
per se as an "attitude" when the concept re- 
lated to the attitude object appeared to be 
universally favored or unfavored. Where it 
could not be assumed that all individuals would 
evaluate the concept related to the attitude 
object in the same manner, most investigators 
have tended to label the statement as a “belief.” 
The statements, however, are neither belief 
nor attitudes. They are all belief statements; 
the belief is the position an individual ascribes 
to the statement on the probability dimension. 
All belief statements do contain an evaluative 
aspect; all beliefs about an object carry some 
implicit or explicit evaluation of the attitude 
object. Thus, it would seem that the attempt 
to distinguish between “attitudinal” and "non- 
attitudinal" beliefs on the basis of "affect" is 
inappropriate. 

One further point concerning beliefs about 
objects should be considered. In this section, 
we have seen that a Thurstone Scale consists 
of a series of carefully selected belief state- 
ments. Each of the statements has previously 
been assigned some evaluative weight between 
"]" and "11". Further, this evaluative weight 
is primarily determined by the judges' attitudes 
toward the concepts related to the attitude 
object Thus, the respondent is confronted 
with a series of belief statements with different 
evaluative aspects. The respondent's task is 
simply to indicate by a check mark those state- 
ments "which come closest to his own feelings." 
His attitude is then considered to be indexed 
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by the mean or median value of the statements 
with which he agrees. Thus, the single score 
that represents the respondent's degree of fa- 
vorableness or unfavorableness toward the atti- 
tude object is obtained through a consideration 
of some of his beliefs about the object. 

Likert Scales also consist of a series of care- 
fully selected belief statements. However, rather 
than differentially weighting each of the state- 
ments, each statement is simply considered as 
indicating favorableness or unfavorableness 
toward the attitude object. Instead of checking 
only those statements with which he fully agrees, 
the respondent is asked to indicate how 
strongly he agrees with (i.e. believes) or disa- 
grees with (ie. disbelieves) each statement, 
usually by checking a five-step scale ranging 
from "strong agreement" to "strong disagree- 
ment." Each response is then given a score 
from 1 to 5, and the sum of the values is taken 
as the index of the respondent's attitude. The 
higher the sum, the more favorable the atti- 
tude. Thus, once again, it can be seen that the 
single score that represents the respondent's 
attitude is obtained through a consideration 
of some of his beliefs about the object. 

Similar descriptions could be made for most 
other types of "attitude scales,” e.g, Guttman 
Scales, Bogardus Scales, etc. Indeed, almost 
every attitude measurement instrument ob- 
tains its attitude index through a consideration 
of beliefs and their evaluative aspects. How- 
ever, although there is this basic similarity 
among most attitude measuring instruments, 
there are also several important differences. 
For example, Thurstone and Likert Scales 
place different emphasis on the relative im- 
portance of belief strength and the evaluative 
aspects of belief in obtaining attitude scores. 
That is, in Thurstone's system, an individual 
either agrees with (“1”) or does not agree with 
("0") each statement, and the evaluation asso- 
ciated with the statements varies from “1” 
(extremely unfavorable) to “11” (extremely 
favorable) In Likert's system, the degree of 
agreement varies from “l” to “5,” and each 
statement is simply considered as indicat- 
ing favorableness (“+1”) or unfavorableness 
("—1"). Further, it should be recalled that in 
Thurstone Scaling, the attitude score is ob- 
tained by computing the mean or median scale 
value of the sratements the respondent agreed 
with, while in Likert Scaling, the attitude score 
is obtained by summing the subject's responses 
to every item. 

Thus, in a sense, when one uses a Thurstone 


or Likert Scale, he is implicitly accepting fun- 
damentally different theoretical conceptions of 
attitude. More specifically, from the point of 
view of Thurstone Scaling, attitudes are pri- 
marily determined by the direction and inten- 
sity of the evaluation associated with an 
individual's beliefs about the attitude object. 
Neither the strength of these beliefs nor any 
"disbeliefs" are viewed as determinants of atti- 
tude. Further, the process of attitude organi- 
zation is viewed as a process of cognitive 
averaging or cognitive balance. 

In contrast to this, from the point of view 
of Likert Scaling, attitudes are primarily deter- 
mined by the strength of an individual's be- 
liefs and disbeliefs about the attitude object. 
Although the direction of the evaluation asso- 
ciated with each belief does serve as a deter- 
minant of attitude, the intensity of the 
evaluation does not. Finally, the process of 
attitude organization is viewed as a process of 
cognitive summation. 

Although the above discussion has pointed 
out some major differences between Thurstone 
and Likert Scaling techniques, their essential 
similarity should not be overlooked. These 
scales, like most other attitude measurement 
instruments, obtain an attitude score through 
a consideration of a respondent's beliefs about 
the attitude object. Although the different mea- 
surement techniques may differ with respect to 
the type of items they contain (e.g, a Likert 
Scale will not include an item with a neutral 
evaluative aspect while a Thurstone Scale will), 
and the relative weights that are given to belief 
strength and the evaluative aspects of beliefs, 
the essential point is that they are in complete 
agreement about the basic elements that must 
be taken into account (i.e. the strength and 
evaluative aspects of beliefs) in order to mea- 
sure attitude. However, as was noted above, 
many important belief statements will not ap- 
pear on most standardized instruments because 
they do have different meanings for different 
people. Thus, most measures of attitude are 
based on a very restricted range of belief state- 
ments. It would seem then, that the best esti- 
mate (or measure) of attitude would be one 
which (a) did not restrict the range of belief 
statements, and which (b) put equal weight on 
belief strength and the evaluative aspect of be- 
lief. Although this will necessitate obtaining 


- independent measures of the evaluative aspects 


of beliefs, the increase in work should be more 
than compensated for by increased precision in 
attitude measurement. 
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29. Intensity and a Zero Point for Attitude Analysis 


LOUIS GUTTMAN and EDWARD A. SUCHMAN 


l. THE NEED FOR A ZERO POINT 


In common parlance, people are said to be 
happy or unhappy, to be intelligent or stupid, 
to have a favorable or unfavorable attitude to- 
ward something, to be for or against a certain 
political candidate. Such dichotomizations are 
made with respect to characteristics that are 
recognized not to consist simply of two catego- 
ries; in common parlance, degrees of happiness, 
intelligence, or favorableness are considered to 
exist. The problem that we wish to consider in 
this paper is: Is there any sense to dividing 
people into two kinds—positive and negative— 
when the variable under consideration com- 
prises many differences in degrees? 

This problem occurs in much of attitude 
analysis and especially in public opinion anal- 
ysis. Public opinion polls attempt to state, for 
example, how many people are “pro-Russia” 
and how many are "anti-Russia," how many are 
“pro-labor” and how many are “anti-labor,” 
how many are "in favor of" certain govern- 
mental policies and how many are "against" 
these governmental policies, etc., etc. The prob- 
lem of how to determine such dichotomizations 
is not in general susceptible to solution by the 
use of an external criterion like going to the 
polls and voting, or contributing money, or 
some consideration. Such external criteria are 
important and of great interest, but seem to 
afford only an indirect approach to the prob- 
lem of defining a cutting point which is intrin- 
sic to the attitude or opinion. It is conceivable 
that a person is "against" a certain candidate 
but will vote for him because he is even more 
"against" the opposing candidate, or he is 
"against" a given proposition but will endorse 
it. if the only alternative is one he considers 
even worse; or he may be "for" a candidate but 
even more so "for" the opponent, or "for" a 
* Reprinted from Americam Sociological Review, 
1947, 12, 55-67, with permission of the senior 
author and the American Sociological Association. 


proposition but even more so "for" an alterna- 
tive one. Furthermore, different external cri- 
teria will produce different cutting points. For 
example, should a man be considered "pro- 
labor" or "anti-labor," if he contributes money 
to a labor welfare fund, but still crosses a 
picket line? 

It seems, then, that an intrinsically mean- 
ingful cutting point should be defined in a 
manner that will not depend on an external 
criterion. If such an internal dichotomization 
can be made, then in addition to solving some 
current methodological problems, it will open 
up new avenues of research in attitude and 
public opinion analysis which should increase 
the understanding of the depth or intensity of 
feeling with which opinions are held, and 
should also facilitate predictions. For example, 
of all people who are favorable to a candidate, 
how many will turn out to vote for him? Of all 
people who are well-adjusted now in their mar- 
riage, how many will become divorced in the 
future? 

The need for an internal definition of a zero 
point is especially vital for public opinion polls. 
Research workers in this field are acutely aware 
of the problem of "bias" in the wording of 
questions, and thus far there has been no satis- 
factory solution to this problem. It is well- 
known that the wording of questions, the order 
of presentation of questions, the wording of 
the checklist of answers, or the use of free 
responses in answers, and a whole host of re- 
lated things, can change the apparent opinion 
of the respondent. Furthermore, any single 
question asked on an issue is but one example 
of all possible questions that could be asked on 
that same issue, and the proportion saying 
"agree" or "yes" to these questions can range 
from zero to 100 per cent. For example, a 


1 Compare the "tree" of public opinion in Hadley 
Cantril, ed., Gauging Public Opinion, Princeton 
University Press, 1944, pp. 26-27. 
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study of soldiers’ attitudes toward the British 
asked, “Do you agree or disagree with the fol- 
lowing statements about the British? . . . 


... The British are doing as'good a job as 
possible of fighting the war, everything con- 
sidered. 

... The British always try to get other coun- 
tries to do their fighting for them." 


We find 80 per cent of the soldiers agreeing to 
the first statement, but only 48 per cent dis- 
agreeing to the second statement (Table 1). 


TABLE 1 
First Second 
Response Statement Statement 
Agree 80 47 
Disagree 17 48 
Undecided 3 5 
"Total 100 100 


Which of these questions is an "unbiased" 
question with respect to the entire issue? 

The problem of “bias,” or differing results 
produced by differing questions, has been widely 
recognized by pollsters; indeed, the utility of 
most of their findings rests almost entirely on 
reaching a solution to this problem. 

An approach to an internal definition of a 
zero point has been made by the Research 
Branch of the Information and Education Di- 
vision of the War Department during the last 
year of the war. This approach begins with the 
idea of content scale analysis and, in fact, is a 
direct consequence or corollary of content scale 
analysis. It affords a determination of a zero 
point—if one exists for a given problem—that 
is completely objective. It does not depend upon 
the wording of any particular question and will 
divide the population into the same two parts 
on an issue regardless of the specific questions 
used. This means that the problem of "bias" 
of question wording or question selection is 
solved in an objective manner which does not 
depend on having "experts" judge the appar- 
ent "bias" of questions. One set of questions 

be made up by a group of people with 
one particular axe to grind, and another set of 
questions with reference to the same issue could 
be made up by a group of people with the 
opposite axe to grind. No matter how these two 
sets of questions were worded, provided only 


that the two sets pertain to the same single 
issue, they will both yield the same proportion 
of people positive and negative on the issue. 

At first, it may seem like an impossibility to 
obtain such an invariant answer. That it is not 
impossible, but in fact highly plausible that 
such an answer can be obtained, will become 
clear from considerations of what is involved 
in content scale analysis and in its corollary, 
the intensity function. 


2. CONTENT SCALE ANALYSIS 


Before we can speak meaningfully about peo- 
ple being divided into positive and negative, it 
must be ascertained whether or not it is at all 
meaningful to arrange people in rank order 
along a single continuum with respect to the 
particular area being studied. This means that 
the area must be tested to see if it is scalable 
for the given population of people. The theory 
of scale analysis has been presented before to 
this Society.? Briefly, the procedure is to con- 
sider a universe of content that is to be studied, 
like attitude towards the Army, attitude towards 
one's job, attitude towards Russia, etc. etc. 
The universe is considered to be indefinitely 
large—it consists of all possible questions that 
could be asked about one's attitude towards 
the topic being studied. This universe is sam- 
pled by making up a series of questions on the 
issue. At least ten to twelve items are ordinarily 
to be used in practice for the pre-testing in a 
preliminary study. The population of people is 
also defined, and a random sample of the popu- 
lation is chosen. The responses of the sample of 
people are then observed with respect to the 
sample of questions. On the basis of these sam- 
ple data, any of several equivalent techniques 
of scale analysis can be used to test the hypoth- 
esis that the universe of content is scalable for 
the population of individuals, e.g., has a single 
content variable for all respondents. 

The Research Branch has been using scale 
analysis as a standard procedure for the past 
four years. Many areas of content have been 
found to be scalable, and very many have been 


2 Louis Guttman, “A Basis for Scaling Qualitative 
Data,” American Sociological Review, 1944, 9:189- 
150. A bibliography of published materials on scale 
and intensity analysis will be found in Louis Gutt- 
man, “Scale and Intensity Analysis for Attitude, 
Opinion, and Achievement,” to appear in the Pro- 
ceedings of the Conference of Military Psychologists 
on Methodological Contributions to Psychology to 
be published soon by the University of Maryland. 
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found not to be scalable for given populations 
of people. 


3. THE SAMPLING OF ITEMS 


For our present purposes, it is important to 
recognize the distinction between the sampling 
of items or questions and the sampling of peo- 
ple. The sampling of people can be analyzed 
according to ordinary statistical procedures since 
the sampling can be done at random (with or 
without stratification) in the strict sense of the 
term. On the other hand, for the sampling of 
items or questions a completely different prob- 
lem exists. The process of item selection or 
construction is not at all a random sampling 
process; it is a psychological process undergone 
by the research worker which, thus far, has little 
analytical theory behind it. It seems certain at 
present that ordinary sampling theory is inap- 
propriate for the problem of item selection. 
Scale analysis affords an approach to an answer 
in that its results do not depend on the partic- 
ular sample of questions used. No matter what 
sample of questions is used (provided only that 
the marginal frequencies are sufficiently differ- 
ent) the hypothesis can be tested that they are 
from a scalable universe of questions. The rea- 
son for this relative independence of the sample 
of questions is that the hypothesis of scalability 
is an extremely simple and highly restrictive 
one. If the entire universe of questions is scal- 
able, then any sample of questions must prove 
to be scalable no matter how it is chosen. 
Hence, if a sample of questions is not scalable, 
this proves the universe is not scalable. If a 
sample of questions is scalable and is large and 
diverse enough, then the inference is made that 
the universe is scalable.’ 

If a universe of content is scalable, it is mean- 
ingful to say that some people are higher than 
others on the universe and in fact to arrange 
the whole population in a rank order on the 
area from high to low. Furthermore, the rank- 
ing of the people on any sample of questions 
must be essentially that which would be ob- 
tained from the whole universe of questions. 
Any sample of items or questions from a scal- 
able universe must rank the people in essen- 
tially the same order as any other sample of 
items. Thus, the rank order of individuals from 
high to low, or from more to less favorable, for 


3 Like all inferences made about a population 
from a sample, this is, of course, not a completely 
certain inference. 


one set of attitude or opinion questions will be 
the same as the rank order from a completely 
different set of questions on the same topic. 
"This is the first invariance property that makes 
obtaining an invariant zero point possible. By 
obtaining another invariant ranking, on inten- 
sity, and relating these two invariant rankings, 
we obtain an invariant zero point. 


4. INTENSITY ANALYSIS 


If a given universe of content is not scalable, 
then it is not meaningful to order people from 
high to low, and in particular it is not meaning- 
ful to speak of people being positive or negative 
on the area. The lack of scalability indicates 
that more than one content variable is involved 
and no single ranking or division into pro and 
con can be made. 

If an area is scalable, what intrinsic proper- 
ties must a zero point have? There is meaning 
in the statement that one person is higher than 
another on the area because of the scale pattern. 
What should it mean to say that one person is 
positive and another person is negative? If a 
zero point were to be defined, in what respect 
are two people the same if one is a certain 
distance above the zero point and the other is 
the same distance below the zero point? They 
are different in that the first is higher than the 
second in the content scale ordering, but they 
are the same in their distance from the zero 
point. What shall we name this second variable 
on which they are the same? The answer pro- 
posed here is to call this second variable the 
intensity function. If two people have the same 
intensity of feeling on an issue but differ in 
their position on the content scale, then they 
must be on opposite sides of the zero point. 

If intensity of feeling goes up as one moves 
either to the right or to the left of the zero 
point, then intensity must be a U or J shaped 
function of the content scale order. The zero 
point can then be determined as that point on 
the content scale at which the intensity func- 
tion reaches its minimum; that is, as that con- 
tent point corresponding to the bottom of the 
U or J. 

The problem is to measure the intensity func- 
tion empirically. If this can be done, the zero 
point can be determined by plotting intensity 
of feeling against content score. This has been 
done by the Research Branch for many areas; 
in each scalable case, a U or J shaped curve 
emerged and a zero point was thereby approx- 
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imately determined. The technique used for 
measuring the intensity function is far from 
perfect and better suggestions will undoubtedly 
be forthcoming in the future. However, crude 
as it is, the technique does work and gives 
fairly satisfactory results. Several examples will 
be presented here of the use of this technique. 


5. ILLUSTRATIONS OF THE INTENSITY 
FUNCTION 


A survey was conducted in June, 1945, among 
a cross-section of 1,730 enlisted men in the 
United States to determine the soldier's attitude 
toward the Women's Army Corps (the WAC). 
A series of six questions was asked as a sample 
of all possible questions in the universe of "at- 
titude toward the WAC.” One of these ques- 
tions was, for example: 


(a) Suppose a girl friend of yours was consid- 
ering joining the WAC, would you advise her 
to join? 

—— Yes, I'm almost sure I would 

—— Yes, I think I would, but I'm not sure 

—— No, I probably would not 

—— No, I'm sure I would not 

—— Undecided 


In order to measure intensity of feeling, each 
of the questions was followed by another ques- 
tion [part (b)] asking: 


(b) How strongly do you feel about this? 
— Very strongly 
—— Fairly strongly 
—— Not so strongly 
—— Not at all strongly* 


The parts (a)—the content—and the parts (5) 
—the intensity—were analyzed separately for 
scalability. The six content questions proved to 
form a scale, and the intensity to form a quasi- 
scale. One essential difference between a scale 
and a quasi-scale is that from scale scores one 
can reproduce the response of each individual 
to each item (within the margin of error pro- 
vided by the coefficient of reproducibility of the 
scale), while in a quasi-scale this internal repro- 


* As will be reported in the forthcoming publica- 
tion on the work of the Research Branch, intensity 
of feeling can be measured in many different ways. 
For example, the same results were obtained by 
asking after cach attitude question, “How sure are 
you of your answer?” or “How hard was it for you 
to make this choice?” 


ducibility is not possible. An important prop- 
erty that a scale and quasi-scale do have in 
common is that each provides an invariant or- 
dering of people with respect to the sampling 
of questions, and this is all that we require for 
our present purpose. The rank order of respon- 
dents on a sample of items from a quasi-scale 
would be essentially the same rank order as that 
in the universe of all items that could have 
been used. Therefore, the ordering of the sol- 
diers on content and on intensity is essentially 
invariant with respect to the particular sample 
of six questions used. Each of the six content 
and intensity questions were dichotomized to 
produce seven ranks. Each soldier received two 
scores: a content score ranging from 0 to 6 de- 
pending upon the number of questions upon 
which he held “positive” attitudes toward the 
WAG, and an intensity score ranging from 0 
to 6 depending upon the number of answers 
about which he felt "strongly." Tabulating in- 
tensity score against content score for the 1,730 
soldiers yielded the frequency distribution shown 
in Table 2. 

If the pure intrinsic intensity function were 
being measured by the technique of using parts 
(b), then "Table 2 should show intensity as a 
perfect U or J shaped function of content. 
Table 2 shows that this is not the case, that 
there is much error in the technique; but de- 
spite the considerable amount of error the es- 
sential shape of the intensity function is 
apparent. The italicized frequency in each col- 
umn of Table 2 corresponds to the interval in 
which the median intensity lies for each content 
interval. A proper graphic presentation of the 
curve of medians is to express the data, not in 
the crude ranks observed, but in estimates of 
what the rank on the whole attitude universe 
would be if indefinitely many questions had 
been asked. The percentile metric is a way of 
doing this. The cumulative frequencies for the 
row (content) and column (intensity) marginals 
of Table 2 appear at the bottom and to the 
right of the table respectively. The last row of 
Table 2 indicates the estimated median percen- 
tile of each column. In Figure 1, the percentile 
metric is used for both content and intensity 
so that the people are considered to be arranged 
from zero to 100 per cent according to each of 
the two. Each plotted point corresponds to the 
midpoint of the interval on content and to the 
column median on intensity. 

The curve in Figure | is essentially invariant. 
If a different sample of questions than these 
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Total frequency 
Cumulative 
per cent 
Midpoint of 
content 
percentiles 
Median of 
intensity 
percentiles 


(High) 100 
90 


Intensity percentile 
ss ss 88s 8 
T 


(Low) 0 
0 10 20.30.40 99 MN 
i ve 
ars Content percentile 
Figure 1. Attitude of the enlisted men toward the 
WAC. 


particular six had been used, essentially the 
same curve would result if plotted by the per- 
centile technique.® 

In particular, the region in which the zero 
point lies is invariant. It is not possible to 
ascertain exactly where the zero point is, but 
clearly it should be between 54 and 91, the 
midpoints of the intervals surrounding the low- 


5 Several examples of similar curves and zero- 
points obtained from different samples of questions 
from the same attitude universe will be presented 
in the forthcoming publication on the work of the 
Research Branch, Despite the fact that these dif- 
ferent samples of questions had widely different 
marginal distributions, the tage of respond- 
ents to the right and left of the zero-point remained 
constant, 


TABLE 2. Attitude of Enlisted Men Toward the WAC 


Content Rank 


est point of the plotted curve. We can therefore 
say that, according to this sample of soldiers, 54 
per cent of the Army had an unfavorable at- 
titude toward the WAC; 9 per cent had a fa- 
vorable attitude toward the WAC; and the 
remaining 37 per cent were in between. As 
more and more questions are added to the scale, 
the size of the zero-range will become more 
narrow and the exact location of the zero-point 
can be determined with greater accuracy. On 
the basis of the present sample of six questions, 
the zero-point is approximately at 75 per cent. 

The fact that the curve in Figure 1 is sus- 
pended in mid-air and does not reach the bot- 
tom percentile of intensity is due to the presence 
of error in the technique for ascertaining in- 
tensity. The column medians are being plotted; 
since there is error, the medians are away from 
where they would be if there were no error. 
The essential shape of the intensity curve that 
would be obtained if there were no error seems 
rather apparent from Figure 1, and it seems 
safe to assume that the pure curve would ac- 
tually touch bottom in a region within the zero 
interval obtained from the observed curve. 

A technique for obtaining a single point 
approximation for the zero point is to use the 
median content percentile of the people lowest 
on intensity. In the perfect curve, the lowest 
person on intensity would be at the zero point 
on content. Since we have error present in prac- 
tice, we can take an average of the content 
positions of the people lowest on intensity as 
an approximation to the ideal zero point. As 
small a percentage as possible should be used 
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in order to avoid distortion, but the sample 
number of people included must be large 
enough to be reliable. If we consider only the 
bottom row in Table 2, the median content 
percentile of the 315 men therein is, by inter- 
polation, 66 per cent, and we can use this as an 
estimate of the true zero point. This estimate 
does fall in the zero interval just previously 
obtained from the U curve. 

The shape of the intensity function can vary 
considerably from problem to problem. An in- 
teresting example is afforded by a study of 
soldiers’ opinions of the Army's demobilization 
score card plan. Two areas were to be studied. 
One was opinion of the idea of the score card, 
and the other was opinion of how the plan was 
being carried out in practice. Let us consider 
this second area first. Six questions were asked 
as a sample of the universe of content. As in 
the previous example, each question was actu- 
ally in two parts; the first part concerned the 
content, and the second part elicited intensity 
of feeling. One of the questions was, for ex- 
ample: 


(a) In general, do you think the Army is trying 
its best to carry out the Army score card plan 
as it should be carried out? 
— Yes, it is trying its best 
—— It is trying some, but not hard enough 
— It is hardly trying at all 
(b) How strongly do you feel about this? 
— Not at all strongly 


—— Not so strongly 
—— Fairly strongly 
—— Very strongly 


Each part (a) of the remaining five questions 
asked opinion about how the plan was being 
carried out; each part (b) was identical with 
the above for all the questions. 

The joint distribution of content and inten- 
sity ranks for the attitude of soldiers on how 
the score card plan was being carried out is 
given in Table 3, and the approximate inten- 
sity function is plotted in Figure 2. The ex- 
pected U curve of intensity on content results 
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Figure 2. Opinion of enlisted men on the admin- 
istration of the Army score card system. 


TABLE 3. Opinion of Enlisted Men on the Administration of the 
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with the low point of the curve, the zero-point, 
falling between percentile 28 and 67 of the con- 
tent scale. There is indicated a split among 
the soldiers in their attitude toward the way 
in which the score card plan was being ad- 
ministered, with half the soldiers having a 
favorable attitude and half having an unfa- 
vorable attitude. In general the curve has a 
rather flat bottom indicating a wide zero-range, 
or area of indifference. The shape of this curve 
is quite different from the curve on attitude 
toward the WAC. 

The other aspect of the problem, the soldier's 
attitude toward the idea of the score card plan, 
produced an altogether differently shaped in- 
tensity curve. Seven questions, containing the 
content in part (a) and the intensity in part (b) 
were asked as a sample of all questions that 
could have been asked in this area. 

The joint distribution of content and inten- 
sity ranks for soldiers’ attitudes toward the idea 
of the score card plan is given in Table 4, and 
the approximate intensity function is plotted 
in Figure 3. The zero point seems to be indefi- 
nitely far to the left! This indicates that there 
was little or no unfavorable opinion about the 
idea of the score card—practically everybody 
was favorable. Such a conclusion could not be 
ascertained by looking at the marginal frequen- 
cies of the individual questions asked in this 
area. For example, one of the questions was, “In 
general, what do you think of the Army score 
card plan (the point system)?” The responses 
were as follows: 
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Figure 3. Attitude of enlisted men toward the idea 
of the Army score card plan. 


ut It is very good 
It is fairly good 
> It is not so good 
7 Itis not good at all 
5 Undecided 
100% 

It is important to notice that the technique 
of single point estimation of the zero point by 
using the content median of the lowest inten- 
sity group breaks down in this instance. Using 
the lowest intensity technique is not safe if the 
intensity function is too asymmetric. 

Illustrations of other shapes of curves are pre- 
sented in Figure 4, but further discussion will 
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Figure 4. Four examples of approximate intensity curves of attitudes of enlisted 
men. 


be reserved for the forthcoming publication of 
the Research Branch which will summarize the 
findings of its four years of work in surveying 
soldier attitudes and opinions. Briefly, we can 
say that flat-bottom U shaped curves indicate 
that there is not much difference between pros 
and cons; except for those relatively few people 
who are very extreme on either side, most of 
the people are relatively indifferent. Very sharp 
V curves indicate a clear distinction between 
being positive and being negative. 


6. THE PROBLEM OF ERROR 


The technique of asking "How strongly do 
you feel about this?" after each question is ad- 


mittedly a crude one and accounts for much of 
the error in the observed relationship between 
intensity and content. If the intrinsic intensity 
could be ascertained, the relationship should 
be perfect. How error is introduced by the tech- 
nique just described can be shown in several 
ways. Some men would say "undecided" to à 
question. When asked the part (b) about 
strength of feeling, they say they felt “very 
strongly." When they were asked why they said 
"very strongly" to part (b) when they were un- 
decided on part (a), they would answer to the 
effect that by "very strongly" they meant that 
the problem was very important, or else that 
they were very sure that they were undecided, 
or some such thing. Thus, many of the responses 
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were out of context and contributed to error 
in the frequency distribution. 

Even if all the responses were in context, 
there is still a contribution to error from the 
fact that the degrees of meaning of the words 
vary from sub-group to sub-group of the popu- 
lation. Verbal habits of people are considerably 
different. Some people will say “strongly agree” 
to almost anything when they are in favor of it, 
where other people would say “agree” under 
the same circumstances. Especially with respect 
to the intensity questions, there are people who 
say “very strongly” to every question. This ten- 
dency to use or not to use strong adjectives, we 
shall call “generalized verbal intensity.” That 
it exists as a quasi-scale for the Army popula- 
tion has been shown in several surveys, as will 
be reported in the forthcoming publication of 
the Research Branch. Generalized verbal inten- 
sity was measured by studying ten or so differ- 
ent universes of content (attitudes) in a single 
survey. One part (b) question on intensity was 
selected from each of the ten different areas at 
random, and it was found that the ten intensity 
questions formed a quasi-scale. 

It was also found that responses to individual 
questions were related to this generalized in- 
tensity, This finding emphasizes all the more 
that responses to any single content question 
must be regarded with suspicion and re-empha- 
sizes the need for an objective technique for 
obtaining a zero point such as that described 
in this paper. 

An example of how “generalized” intensity 
can be related to responses to a particular con- 
tent question is the following. The sample of 
soldiers was divided into three groups according 
to their generalized intensity: low, medium, 
and high. When asked the question, “All things 
considered, do you think the Army is run about 
as efficiently as possible, or do you think it 
could be run better?” the responses for the three 
groups turned out to be as shown in Table 5. 
The categories obviously mean different things 
to the different kinds of people; for this reason 
the interpretation of any frequency distribution 
of responses to a single question must take into 
account the verbal habits of the respondents. 

This “generalized” intensity, of course, is a 
verbal variable and reflects verbal habits of peo- 
ple; it is worth: much further exploration. A 
research study was undertaken to find out if 
this generalized verbal variable correlated with 
personality areas as investigated by conventional 
personality inventories. Four different personal- 


TABLE 5 


General Intensity 
Low Medium High 


It is run as well as 


possible 38% 32% 28% 
It could be run 
somewhat better 42 36 27 
It could be run a 
lot better 20 32 45 
Total per cent 100% 100% 100% 
Total cases 441 800 525 


ity areas were studied: inferiority, hypomania, 
psychopathic deviate, depression. All areas cor- 
related to some degree with generalized inten- 
sity. These personality variables were also 
correlated with content of specific opinion ques- 
tions. This suggests a rich field of exploration 
of the role that personality traits play in atti- 
tudes and opinions on political, social, and eco- 
nomic matters, and from a methodological point 
of view in determining respondents’ verbal hab- 
its and modes of expression. 


7. SUMMARY 


1, One purpose of intensity analysis is to 
provide an objective method of determining an 
invariant cutting point for an attitude or opin- 
ion scale. This cutting point will enable the 
research worker to divide his population into 
favorable and unfavorable groups, a division 
which will be independent of the selection or 
“bias” of the specific questions asked. 

2. The basis of intensity analysis is the theory 
of scaling which provides a test for the single 
meaning of a series of questions. Any single 
question or series of questions on an issue is 
considered as simply a sample of all other ques- 
tions on the same issue which might have been 
asked instead. The problem of scaling is to test 
whether the particular sample of questions used 
can be considered as belonging to an attitude 
or opinion universe that contains only one di- 
mension. Once a selected series of questions has 
been determined to be scalable, it is meaning- 
ful to rank the respondents from high to low 
on the attitude universe being studied. 

3. In addition to the rank order of individ- 
uals on the content or attitude scale, it is also 
possible to determine the intensity with which 
an attitude is held. This intensity measurement 
is found to be a U or J shaped function of the 
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content scale. People on both ends of the con- 
tent scale feel more strongly than people to- 
wards the middle of the scale. As one moves 
down the content scale, intensity of feeling de- 
creases until a point is reached where intensity 
of feeling begins to increase again. This point 
is invariant for any single attitude area, and 
regardless of the sample of attitude questions 
used will always divide the population into the 
same proportion with positive and negative 
opinions. Thus, this cutting point is both in- 
variant and objective. 


4. The method used in this paper for mea- 
suring intensity of feeling was to ask, “How 
strongly do you feel about this?" after each 
attitude question. Correlating the content scale 
scores with the intensity scores produced the 
expected U or J shaped curve. This technique | 
of measuring intensity is far from perfect and 
results in a large amount of variance around 
the curve. Further research will undoubtedly 
serve to reduce much of this variance. However, 
crude as it is, the present technique does work 
and has been used successfully in many instances. 


30. Response Biases and Response Sets 


J. P. GUILFORD 


By “response bias” we mean that a response to 
a test item tends to be altered in such a way 
that it indicates something other than that 
which we intended it to measure. The biases in 
which we are particularly interested here are 
usually determined by mental sets on the part 
of the examinee that are other than optimal 
for ensuring valid measurement by means of 
the item. Cronbach (1, 2) has done us a great 
service by his systematic study of response sets 
and their effects. This subject will concern us 
in the next few paragraphs. 


KINDS OF RESPONSE SETS 


The following list of response sets is essen- 
tially Cronbach’s, with some difference in ter- 
minology and one additional type of set. 

1. The set to gamble. Whenever clear-cut 
response alternatives are presented, unless E is 
told to respond to every item, there will be in- 
dividual differences in caution or lack thereof. 
Caution is negatively correlated with the right 
score to the extent that the gambler adds to his 
score by chance success. 

Chance success contributes error variance, 
and this attenuates intercorrelations. . . . The 
parameter of item difficulty has much to do 
with the problem. As item difficulty increases, 
the amount of guessing increases, and the ele- 
ment of chance success increases in importance. 
Examinees who know all the answers make per- 
fect scores, and chance success plays no role in 
them. Examinees who know all except 10 of the 
answers and who guess on the remaining 10 
items will have scores of n — 10 +e, where e 
is a chance increment. . . . Thus the more dif- 
ficult the test for the population, the larger is 
the element of chance and the lower will be 


* Excerpted from Psychometric Methods, 2nd Ed., 
McGraw-Hill, New York, 1954, with permission of 
the author and the publisher. 


the correlation between that test and anything 
else.1 

There are other aspects to the problem when 
we consider tests of temperament and interest. 
In many such tests there is a neutral category, 
such as “indifferent,” “uncertain,” or “?.” Cau- 
tion, although it may be of a different kind 
than that in tests of ability, would be expected 
to favor the use of the neutral category. Cron- 
bach reports (1) that a score based entirely on 
the number of neutral responses had a reliabil- 
ity coefficient of .73. The writer has found about 
the sa liability for number of “?” responses 
in differeht temperament inventories. Thus the 
tendency to use the neutral category is a con- 
sistent habit. There is considerable true vari- 
ance in this kind of score. Does it mean cau- 
tion and caution only? It could mean indeci- 
sion, or lack of observation, or it could mean 
a genuinely moderate amount of the trait indi- 
cated by the keyed items. It could actually mean 
that the examinee is inclined in one direction 
or another on the trait continuum for which 
the keyed score was designed. The writer has 
found that when a "?" response is correlated 
with a criterion for the trait in item analysis it 
often correlates significantly and sometimes as 
high as one of the other responses. Thus the 
reliability such as Cronbach reports for a neu- 
tral-response-bias score may be loaded very much 
with variance in common factors we want to 
measure. It cannot all be attributed to caution 
or anything else without evidence. 

2. Semantics. Where the categories of re- 
sponse are such as “agree,” “strongly disagree,” 
“dislike,” and “sometimes,” there is much room 
for individual interpretations. What one per- 
son calls "sometimes," another calls "often," 
and so on. Resulting constant errors may well 


1 This paragraph has been added to the article by 
the editor. It is taken from page 437 of Guilford's 
original text. 
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cumulate to bias scores to degrees that should 
cause concern. Very little has been done to es- 
tablish just how much difference in quantitative 
meaning such descriptive terms entail or how 
much they affect scores. The fact that we ob- 
tain high reliability coefficients of scores is no 
assurance of their lack of consequence, for these 
personal deviations in interpretation probably 
contribute to true variance. They would detract 
from validity. 

We encounter the same kind of difficulty in 
the method of constant stimuli when doubtful 
judgments are used, and we find it in the use 
of rating scales where verbal cues are used to 
guide the rater. No doubt any method that will 
serve to stabilize individual interpretations of 
words will go a long way to improve psycho- 
logical measurements of different kinds. 

3. Impulsion. An "impulsion" set is seen 
operating when there is the alternative for E to 
respond or not to respond; to mark or not to 
mark. This situation occurs in multiple-choice 
tests in which, given a model object, E is to 
mark all those in a list that resemble it or are 
identical with it. Es differ in the number of 
marks that they make. The more marks they 
make, the greater their chance of hitting the 
right ones. Unless there is a penalty for errors 
of commission the score favors the liberal 
marker. The same kind of bias occurs in using 
check lists, where each item is to be marked or 
not marked. It occurs in essay examinations in 
which the amount written weighs heavily in the 
mark given. 

1. Acquiescence. This is the error of the “yes 
man,” at least for the “yes man" of a type. The 
set is to favor affirmative responses over nega- 
tive responses. There may be a few negativistic 
persons who are inclined in the opposite direc- 
tion. The point here is that each person has 
some general disposition on this continuum, 
positive or negative. It is found, however, that 
more individuals give an excess number of 
"true" responses in true-false tests, regardless of 
the actual composition of the test. This has 
several interesting consequences. For example, 
the poor student who guesses a great deal makes 
à good score on the true items but a very poor 
score on the false items. Sometimes the score 
based on only the false items correlates ex- 
tremely high with that on the total test. 

The same kind of set may be found in tests 
answered by "Yes" and "No." The number of 
"Yes" responses may be excessive on some pa- 
pers. This may represent the actual situation 


with respect to this person's status in the traits 
that are scored, but it may also represent dis- 
tortions due to the acquiescence set. 

5. Speed versus accuracy. This is a long recog- 
nized disturber that has received much atten- 
tion. In the practice of group testing it is 
sometimes essential, or at least it is very con- 
venient, for every examinee to be allotted the 
same limited working time on a test. This fact, 
particularly, has raised many questions concern- 
ing the effect of working time upon scores and 
upon measurement. ... There are . . . funda- 
mental questions concerning what psychological 
qualities are measured when time is liberal ver- 
sus when time is short. . . . The number (of 
items) attempted means different things for dif- 
ferent individuals. Some examinees make no 
responses until they feel confident of the an- 
swer, while others record an answer even when 
they know they are guessing. In a speed test, 
too, differences in motivation level may have 
an important bearing on the number of answers 
recorded. Thus, speed conditions where items 
are not very easy open the door to many uncon- 
trolled determiners of individual differences in 
scores. It appears that speed of work is a rela- 
tively stable personal trait somewhat related to 
kind of task but not fully determined by kind 
of task or level of difficulty. The psychological 
nature of these speed factors is unknown, but it 
is a good hypothesis that they represent moti- 
vational and temperament variables as well as 
or instead of abilities.? 

6. Falsification. Since the time when temper- 
ament and interest inventories became popular, 
we have had a new source of bias recognized 
and identified by various terms such as faking- 
good and sophistication as well as the term 
chosen here, falsification. The set is motivated by 
the desire to make a good score or to make a 
good appearance and to cover up defects and de- 
ficiencies. If the examinee has any idea about the 
kinds of traits being measured, as is more likely 
to be true of vocational-interest inventories, it 
has been shown that he can manage to increase 
certain scores by a judicious choice of responses, 
without regard to whether those responses de- 
scribe him or not. In temperament inventories 
he is not so likely to guess successfully what traits 
are being scored, but as he comes to each item he 
may have his own ideas as to which response is 


2 This paragraph has been revised by the editor. 
To a large extent, it has been taken from the 
discussion on pages 365-370 of Guilford’s original 
text. 


more desirable to him or to his examiner. If 
his ideas of desirability are correlated with any 
trait key, he can thus increase that score. If his 
ideas are at random with respect to any trait 
key, he may not change his score, but he has 
added much chance variance to his total scores. 
If he is applying for or competing for a certain 
assignment, he will probably think what are the 
best responses to make to each item in the light 
of this assignment. He may be very mistaken 
about the way in which items are actually keyed 
and might make a poorer score rather than a 
better one; yet the score he makes does not 
describe him as it should and is therefore 
biased. 

Many studies have shown that under obvious 
inducements much biasing of responses and 
of scores can occur, but no crucial study has 
been made showing how much actually does 
occur under normal testing conditions. It must 
not be supposed that all examinees attempt 
to make a good impression, for many are 
honest. There are some who actually “lean 
over backwards” when reporting upon them- 
selves; there are the soul-searching ones and 
self-flagellating ones; and there are those who 
want to appear mentally incapacitated. The 
user of inventories must be ever alert to the 
occasions for any of these biases, positive or 
negative. 


SOME PRINCIPLES OF RESPONSE SETS 


Before we attempt to suggest corrective mea- 
sures for response sets, it is desirable to extract 
some generalizations. The sources of response 
sets, like those for sets in general, are varied. 
Some of the sources in this connection will 
be implied in what follows. 

1. Sets are consistent and persistent. While 
some sets of an examinee may shift from test 
to test and even from item to item, others rep- 
resent apparently enduring qualities that can 
be called personality traits. Some are very per- 
sistent over time, and some are consistent from 
one test to another and from one administra- 
tion to another of the same test. To the extent 
that consistency occurs, we find that the sets 
contribute to increased reliability of total 
scores, In other words, within the true variance 
of a test, perhaps even among its common- 
factor variances, we may find a contribution 
from one or more of these sets. It is this fact 
that makes possible the use of tests of ability, 
which are more objective, to provide, as by- 
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products, scores for temperament traits, Un- 
fortunately, such variance probably occupies a 
small part of the total-score variance and can- 
not therefore give an unadulterated index of 
status in a temperament trait. Perhaps with 
intentional cultivation this “bias” variance 
can in some tests be augmented and made to 
do duty as a measure of temperament. 

2. Response sets make scores more ambig- 
uous. To the extent that response sets con- 
tribute to the true variance of a test they are 
probably doing so at some expense of the 
common-factor variance that we intended to 
bring out in our measurements. Even if there is 
merely addition of more true variance, it alters 
the meaning of the score and of its interpre- 
tation in use. We should therefore strive to 
get rid of such biases even if to do so lowers 
reliability. The test will be a better diagnostic 
instrument even if the bias variance is replaced 
by error variance, because the latter is not fo- 
cused systematically in any direction while the 
bias variance is. 

3. Response sets operate most in ambiguous 
and unstructured situations. When the instruc- 
tions leave too much to the imagination of the 
examinee, he invents his own goal and his own 
task. If each examinee has his own goal and 
task and if these differ among examinees, we 
have lost the experimental conditions neces- 
sary for meaningful scores. We hear much 
about the virtues of unstructured tests for diag- 
nostic purposes. Since unstructured tests open 
very widely the way for personal response sets, 
and since personal sets lead to ambiguous scores 
and interpretations, the conclusion is inevitable 
that the “scores” from unstructured tests are 
largely uninterpretable. There is a sense in 
which the unstructured test can be of use, and 
that is in the direction of the measurement of 
the sources of response sets when these sources 
are personality traits. But there must be a struc- 
turing of some kind to make the outcome mean 
anything at all univocal concerning the par- 
ticular trait we want to measure. In throwing 
away some of the controls in order to give 
response sets free play, the projective tester 
has thrown away controls that he very much 
needs for interpretable measurement. 

4. Difficult tests open the way to response 
sets. In the multiple-choice test it is when the 
items are difficult for the examinee that he re- 
sorts most to response sets. This parallels very 
closely the situation in psychophysical judg- 
ment. When differences in stimuli become so 
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small that it all seems like guesswork, the ob- 
server falls into habits that determine his se- 
quences of judgments. Since the same test is 
easy for some and difficult for others, this 
poses a special problem, but it is not an insol- 
uble problem. 


THE CONTROL OF RESPONSE SETS 


Some of the facts concerning response sets 
and the principles just stated almost suggest 
their own remedies. Some of the specific steps 
that may be taken to counteract them and 
their effects are suggested below. 

1. Identify the set. We are much better pre- 
pared to do something about a disturbing re- 
sponse set if we can identify it and can know 
that it is present. There are various ways of 
finding indications that a certain suspected set 
is present. Bias scores can be developed and 
some have been in use. The Humm “no-count” 
score, which is the number of "No" responses 
given to the Humm-Wadsworth Temperament 
Scale, is used to indicate that the individual is 
attempting to give "good" answers rather than 
real answers. For the great majority of the 
items in that scale the answer "Yes" is in the 
pathological direction. With the Minnesota 
Multiphasic Personality Inventory there are a 
number of so-called validating scores used to 
indicate various types of departures from cor- 
rect reporting on the part of the examinee. 
It would be important to factor analyze such 
scores to determine whether they are actually 
measuring what they are purported to measure, 
or to validate them in some other way. 

2. Structure the test sufficiently. Administer- 
ing a test is like conducting an experiment. 
If the outcome is to have meaning, experi- 
mental controls must be exerted. There is no 
escaping this scientific requirement. The admin- 
istrator should set the goal and define the task 
for all examinees. This may be accomplished 
by writing better instructions. It might even 
be well to warn E against certain biases and 
their unfortunate effects. In administering 
check lists, one might instruct E to mark a 
stated number of items, or might set upper 
and lower limts for him. 

3. Use good test forms. Some item forms are 
more subject to biases from sets than others. 
The multiple-choice form in which E selects 
one right answer only and in which the alter- 
natives are objectively interpretable seems to 
be best. Yet, even in multiple-choice tests there 


have been indications of bias, in the form of 
slight preference for some answer positions 
over others, especially for low-scoring individ- 
uals. Mosier and Price (6) have provided a sys- 
tem for randomizing the position of the right 
answer, also of the wrong answers. This should 
go far to prevent the test maker from hitting, 
by a position habit of his own, a response 
pattern. that does or does not coincide with 
the examinee's habits. 

Cronbach (2) recommends avoidance of cer- 
tain item forms including the “yes-no” choice, 
the "same-different" choice, the “mark-or-not-to- 
mark" distinction, the categories "like," "in- 
different and "dislike" and of the neutral 
category in general. The writer can agree that 
this would remove much of the trouble with 
response biases, but it would also remove many 
useful tests because at present there are no 
good substitutes for some of these response 
categories. He has not found the indifference 
category so troublesome if it is given an em- 
pirical weight, as indicated earlier. 

The forced-choice type of item is also some- 
times recommended as a solution to some of 
the bias problems. Gordon (3, 4) has found 
slightly higher reliability and validity for in- 
ventory scores obtained from forced-choice 
items than from the usual type. The writer 
believes, however, that there are better ways 
of dealing with the same problems and that 
the forced-choice device introduces some mea- 
surement problems that may be worse than 
those they were intended to correct. 

4. Make multiple-choice tests sufficiently easy. 
This advice has been emphasized a number of 
times. Where a population has considerable 
variability in the ability tested, it might even 
be wise to develop two or three tests at dif- 
ferent levels of ability. Some preliminary rough 
testing would determine at which level each 
individual should be tested and the appropriate 
test would then be applied to him. By scaling 
procedures the scores from the three tests could 
be brought to a common scale. The use of 
limen scores also suggests itself in this connec- 
tion. 

5. Use a good scoring formula. After consid- 
ering the nature of the test and the score dis- 
tributions for rights, wrongs, and unattempted 
items, a favorable a priori formula may be 
chosen. The best solution would be to obtain 
factor-analysis information concerning each of 
these scores and to weight them accordingly. 


Another good solution, if a known criterion 
is to be predicted, is to derive optimal weights 
using formula (15.27). 

6. Use suppressors to remove effects of bias. 
Knowing what kind of bias is present, not hav- 
ing been able to prevent it, we may resort to 
a corrective procedure. This is to use a “sup- 
pressor" score to attempt to negate the bias 
variance in the score. In an inventory in which 
there may be successful falsification, each trait 
score might be summed with a "lie" score (the 
latter with appropriate negative weight) to 
lower the bias variance. Since this is a differ- 
ence score, the suppression as well as the trait 
score should be of high reliability. Even then 
the difference score will be less reliable than 
the trait score, but since it is more univocal, 
it is more interpretable. The same suppression 
score should not be used in combination with 
all trait scores, since this would introduce cor- 
relation among the corrected trait scores due 
to a common element. Levine (5) has made the 
very pertinent suggestion that suppression 
items be planted in the inventory for use 
where suppression is needed. They should be 
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item analyzed and those correlating low with 
the criterion but high with the other trait 
items should be selected to provide the sup- 
pressors for that trait. 

7. Refrain from using tests where biasing sets 
are invalidating. One may as well recognize the 
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31. Convergent and Discriminant Validation by the 
Multitrait-Multimethod Matrix 


DONALD T. CAMPBELL and DONALD W. FISKE 


In the cumulative experience with measures 
of individual differences over the past 50 years, 
tests have been accepted as valid or discarded 
as invalid by research experiences of many 
sorts. The criteria suggested in this paper are 
all to be found in such cumulative evaluations, 
as well as in the recent discussions of validity. 
These criteria are clarified and implemented 
when considered jointly in the context of a 
multitrait-multimethod matrix. Aspects of the 
validational process receiving particular em- 
phasis are these: 


l. Validation is typically convergent, a con- 
firmation by independent measurement proce- 
dures. Independence of methods is a common 
denominator among the major types of validity 
(excepting content validity) insofar as they are 
to be distinguished from reliability. 

2. For the justification of novel trait mea- 
sures, for the validation of test interpretation, 
or for the establishment of construct validity, 
discriminant validation as well as convergent 
validation is required. Tests can be invalidated 
by too high correlations with other tests from 
which they were intended to differ. 

3. Each test or task employed for measure- 
ment purposes is a trait-method unit, a union 
of a particular trait content with measurement 
procedures not specific to that content. The 
systematic variance among test scores can be 
due to responses to the measurement features 
as well as responses to the trait content. 

4. In order to examine discriminant valid- 


* Abridged from Psychological Bulletin, 1959, 56, 
81-105, with permission of the senior author and 
ee — Psychological Association. The new 
ta analyses reported in this were supported 
by funds from the Grades Scheel of North- 
western University and by the Department of 
Psychology of the University of Chicago. 


ity, and in order to estimate the relative con- 
tributions of trait and method variance, more 
than one trait as well as more than one method 
must be employed in the validation process. 
In many instances it will be convenient to 
achieve this through a multitrait-multimethod 
matrix. Such a matrix presents all of the inter 
correlations resulting when each of several 
traits is measured by each of several methods. 


To illustrate the suggested validational proc- 
ess, a synthetic example is presented in Table 
1, This illustration involves three different 
traits, each measured by three methods, gener- 
ating nine separate variables. It will be con- 
venient to have labels for various regions of 
the matrix, and such have been provided in 
Table 1. The reliabilities will be spoken of 
in terms of three reliability diagonals, one for 
each method. The reliabilities could also be 
designated as the monotrait-monomethod val- 
ues, Adjacent to each reliability diagonal is 
the heterotrait-monomethod triangle. The reli- 
ability diagonal and the adjacent heterotrait 
monomethod triangle make up a monomethod 
block. A heteromethod block is made up of a 
validity diagonal (which could also be desig- 
nated as monotrait-heteromethod values) and 
the two heterotrait-heteromethod triangles ly- 
ing on each side of it. Note that these two 
heterotrait-heteromethod triangles are not iden- 
tical. 

In terms of this diagram, four aspects bear 
upon the question of validity. In the first place, 
the entries in the validity diagonal should be 
significantly different from zero and sufficiently 
large to encourage further examination of 
validity. This requirement is evidence of con- 
vergent validity. Second, a validity diagonal 
value should be higher than the values lying 
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TABLE 1. A Synthetic Multitrait-Multimethod Matrix 


Method 1 
Traits Ay B, Gi 
Ay (.89) 
Method 1 B, (89) 
Ci (76) 
AQ Q97 922 09] 
1 SN I i 
Method 2 Bo | 29 ET 0 1 
I SAC aa 
C2 La —d1 7.46 
AU ee" SGT] 
rss dpi | 
Method 3 — Bs — !.23 SS 68-512 I 
I ad S 
€; | 179.45 


Method 2 Method 3 
B; C; As B, C3 
(94) 
(.92) 
60 (.85) 


Note. The validity diagonals are the three sets of italicized values. The reliability 
diagonals are the three sets of values in parentheses. Each heterotrait-monomethod 
triangle is enclosed by a solid line. Each heterotrait-heteromethod triangle is en- 


closed by a broken line. 


in its column and row in the heterotrait- 
heteromethod triangles. That is, a validity 
value for a variable should be higher than the 
correlations obtained between that variable 
and any other variable having neither trait 
nor method in common. This requirement may 
seem so minimal and so obvious as to not need 
stating, yet an inspection of the literature 
shows that it is frequently not met, and may 
not be met even when the validity coefficients 
are of substantial size. In Table 1, all of the 
validity values meet this requirement. A third 
common-sense desideratum is that a variable 
correlate higher with an independent effort to 
measure the same trait than with measures de- 
signed to get at different traits which happen 
to employ the same method. For a given vari- 
able, this involves comparing its values in the 
validity diagonals with its values in the hetero- 
trait-monomethod triangles. For variables A}, 
B,, and C,, this requirement is met to some 
degree, For the other variables, As, Ag ètc., it 
is not met and this is probably typical of the 
usual case in individual differences research, 
as will be discussed in what follows. A fourth 
desideratum is that the same pattern of trait 
interrelationship be shown in all of the hetero- 
trait triangles of both the monomethod and 
heteromethod blocks. The hypothetical data in 
Table 1 meet this requirement to a very 


marked degree, in spite of the different general 
levels of correlation involved in the several 
heterotrait triangles. The last three criteria pro- 
vide evidence for discriminant validity. 

Before examining the multitrait-multimethod 
matrices available in the literature, some ex- 
plication and justification of this complex of 
requirements seems in order. 


Convergence of Independent Methods: The 
Distinction Between Reliability and Validity 


Both reliability and validity concepts require 
that agreement between measures be demon- 
strated. A common denominator which most 
validity concepts share in contradistinction to 
reliability is that this agreement represent the 
convergence of independent approaches. The 
concept of independence is indicated by such 
phrases as “external variable,” “criterion per- 
formance,” “behavioral criterion” (American 
Psychological Association, 1954, pp. 13-15) used 
in connection with concurrent and predictive 
validity. For construct validity it has been 
stated thus: "Numerous successful predictions 
dealing with phenotypically diverse ‘criteria’ 
give greater weight to the claim of construct 
validity than do . . . predictions involving 
very similar behavior" (Cronbach and Meehl, 
1955, p. 295). The importance of independence 
recurs in most discussions of proof. For exam- 
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ple, Ayer discussing a historian's belief about 
a past event, says “if these sources are numer- 
ous and independent, and if they agree with 
one another, he will be reasonably confident 
that their account of the matter is correct” 
(Ayer, 1954, p. 39). In discussing the manner 
in which abstract scientific concepts are tied 
to operations, Feigl speaks of their being 
"fixed" by "triangulation in logical space" 
(Feigl, 1958, p. 401). 

Independence is, of course, a matter of de- 
gree, and in this sense, a reliability and validity 
can be seen as regions on a continuum. (Cf. 
Thurstone, 1937, pp. 102-103.) Reliability is 
the agreement between two efforts to measure 
the same trait through maximally similar 
methods. Validity is represented in the agree- 
ment between two attempts to measure the 
same trait through maximally different meth- 
ods. A split-half reliability is a little more like 
a validity coefficient than is an immediate test- 
retest reliability, for the items are not quite 
identical. A correlation between dissimilar 
subtests is probably a reliability measure, but 
is still closer to the region called validity. 

Some evaluation of validity can take place 
even if the two methods are not entirely inde- 
pendent. In Table 1, for example, it is possible 
that Methods 1 and 2 are not entirely inde- 
pendent. If underlying Traits A and B are 
entirely independent, then the .10 minimum 
correlation in the heterotrait-heteromethod tri- 
angles may reflect method covariance. What if 
the overlap of method variance were higher? 
All correlations in the heteromethod block 
would then be elevated, including the validity 
diagonal. The heteromethod block involving 
Methods 2 and 3 in Table 1 illustrates this. 
The degree of elevation of the validity diag- 
onal above the heterotrait-heteromethod trian- 
gles remains comparable and relative validity 
can still be evaluated. The interpretation of 
the validity diagonal in an absolute fashion 
requires the fortunate coincidence of both an 
independence of traits and an independence 
of methods, represented by zero values in the 
heterotrait-heteromethod triangles. But zero 
values could also occur through a combination 
of negative correlation between traits and posi- 
tive correlation between methods, or the re- 
verse. In practice, perhaps all that can be hoped 
for is evidence for relative validity, that is, for 
common variance specific to a trait, above and 
beyond shared method variance. 


Discriminant Validation 


While the usual reason for the judgment of 
invalidity is low correlations in the validity 
diagonal [e.g., the Downey Will-Temperament 
Test (Symonds, 1931, p. 337ff)] tests have also 
been invalidated because of too high correla- 
tions with other tests purporting to measure 
different things. The classic case of the social 
intelligence tests is a case in point. [See below 
and also (Strang, 1930; R. Thorndike, 1936).] 
Such invalidation occurs when values in the 
heterotrait-heteromethod triangles are as high 
as those in the validity diagonal, or even where 
within a monomethod block, the heterotrait 
values are as high as the reliabilities. Loevin- 
ger, Gleser, and DuBois (1953) have emphasized 
this requirement in the development of maxi- 
mally discriminating subtests. 

When a dimension of personality is hy- 
pothesized, when a construct is proposed, the 
proponent invariably has in mind distinctions 
between the new dimension and other con- 
structs already in use. One cannot define with- 
out implying distinctions, and the verification 
of these distinctions is an important part of 
the validational process. In discussions of con- 
struct validity, it has been expressed in such 
terms as “from this point of view, a low corre- 
lation with athletic ability may be just as im- 
portant and encouraging as a high correlation 
with reading comprehension” (APA, 1954, p. 
17). 


The Test as a Trait-Method Unit 


In any given psychological measuring device, 
there are certain features or stimuli introduced 
specifically to represent the trait that it is 
intended to measure, There are other features 
which are characteristic of the method being 
employed, features which could also be present 
in efforts to measure other quite different traits. 
The test, or rating scale, or other device, al- 
most inevitably elicits systematic variance in 
response due to both groups of features. To 
the extent that irrelevant method variance con- 
tributes to the scores obtained, these scores are 
invalid. 

This source of invalidity was first noted in 
the "halo effects" found in ratings (Thorndike, 
1920). Studies of individual differences among 
laboratory animals resulted in the recognition 
of "apparatus factors," usually more dominant 
than psychological process factors (Tryon, 1942). 
For paper-and-pencil tests, methods variance 
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has been noted under such terms as “test-form 
factors" (Vernon: 1957, 1958) and “response 
sets" (Cronbach: 1946, 1950; Lorge, 1937). 
Cronbach has stated the point particularly 
clearly: “The assumption is generally made . . . 
that what the test measures is determined by 
the content of the items. Yet the final score 

. is a composite of effects resulting from the 
content of the item and effects resulting from 
the form of the item used" (Cronbach, 1946, 
p. 475). "Response sets always lower the logical 
validity of a test. . . . Response sets interfere 
with inferences from test data" (p. 484). 

While E. L. Thorndike (1920) was willing to 
allege the presence of halo effects by compar- 
ing the high obtained correlations with com- 
mon sense notions of what they ought to be 
(e.g., it was unreasonable that a teacher's intel- 
ligence and voice quality should correlate .63) 
and while much of the evidence of response 
set variance is of the same order, the clear-cut 
demonstration of the presence of method vari- 
ance requires both several traits and several 
methods. Otherwise, high correlations between 
tests might be explained as due either to basic 
trait similarity or to shared method variance. 
In the multitrait-multimethod matrix, the 
presence of method variance is indicated by 
the difference in level of correlation between 
the parallel values of the monomethod block 
and the heteromethod blocks, assuming com- 
parable reliabilities among all tests. Thus the 
contribution of method variance in Test A, 
of Table 1 is indicated by the elevation of 
TAB above TAB) ie. the difference between 
.51 and .22, etc. 

The distinction between trait and method 
is of course relative to the test constructor's 
intent. What is an unwanted response set for 
one tester may be a trait for another who 
wishes to measure acquiescence, willingness to 
take an extreme stand, or tendency to attrib- 
ute socially desirable attributes to oneself 
(Cronbach: 1946, 1950; Edwards, 1957; Lorge, 
1937). 


DISCUSSION 


Relation to Construct Validity 


While the validational criteria presented are 
explicit or implicit in the discussions of con- 
struct validity (Cronbach and Meehl, 1955; 
APA, 1954), this paper is primarily concerned 


with the adequacy of tests as measures of a 
construct rather than with the adequacy of a 
construct as determined by the confirmation 
of theoretically predicted associations with 
measures of other constructs. We believe that 
before one can test the relationships between 
a specific trait and other traits, one must have 
some confidence in one’s measures of that trait. 
Such confidence can be supported by evidence 
of convergent and discriminant validation. 
Stated in different words, any conceptual for- 
mulation of trait will usually include implicitly 
the proposition that this trait is a response 
tendency which can be observed under more 
than one experimental condition and that this 
trait can be meaningfully differentiated from 
other traits. The testing of these two proposi- 
tions must be prior to the testing of other 
propositions to prevent the acceptance of erro- 
neous conclusions. For example, a conceptual 
framework might postulate a large correlation 
between Traits A and B and no correlation 
between Traits A and C. If the experimenter 
then measures A and B by one method (e.g., 
questionnaire) and C by another method (such 
as the measurement of overt behavior in a 
situation test), his findings may be consistent . 
with his hypotheses solely as a function of 
method variance common to his measures of 
A and B but not to C. 

The requirements of this paper are intended 
to be as appropriate to the relatively atheoret- 
ical efforts typical of the tests and measurements 
field as to more theoretical efforts. This empha- 
sis on validational criteria appropriate to our 
present atheoretical level of test construction is 
not at all incompatible with a recognition of 
the desirability of increasing the extent to 
which all aspects of a test and the testing situa- 
tion are determined by explicit theoretical con- 
siderations, as Jessor and Hammond have 
advocated (Jessor and Hammond, 1957). 


Relation to Operationalism 


Underwood (1957, p. 54) in his effective pre- 
sentation of the operationalist point of view 
shows a realistic awareness of the amorphous 
type of theory with which most psychologists 
work. He contrasts a psychologist's "literary" 
conception with the latter's operational defini- 
tion as represented by his test or other measur- 
ing instrument. He recognizes the importance 
of the literary definition in communicating and 
generating science. He cautions that the opera- 
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tional definition "may not at all measure the 
process he wishes to measure; it may measure 
something quite different” (1957, p. 55). He 
does not, however, indicate how one would 
know when one was thus mistaken. 

The requirements of the present paper may 
be seen as an extension of the kind of opera- 
tionalism Underwood has expressed. The test 
constructor is asked to generate from his liter- 
ary conception or private construct not one 
operational embodiment, but two or more, each 
as different in research vehicle as possible. Fur- 
thermore, he is asked to make explicit the dis- 
tinction between his new variable and other 
variables, distinctions which are almost certainly 
implied in his literary definition. In his very 
first validational efforts, before he ever rushes 
into print, he is asked to apply the several 
methods and several traits jointly. His literary 
definition, his conception, is now best repre- 
sented in what his independent measures of the 
trait hold distinctively in common. The multi- 
trait-multimethod matrix is, we believe, an im- 
portant practical first step in avoiding “the 
danger . . . that the investigator will fall into 
the trap of thinking that because he went from 
an artistic or literary conception . . . to the 
construction of items for a scale to measure it, 
he has validated his artistic conception" (Un- 
derwood, 1957, p. 55). In contrast with the sin- 
gle operationalism now dominant in psychology, 
we are advocating a multiple operationalism, a 
convergent operationalism (Garner, 1954; Gar- 
ner, Hake, and Eriksen, 1956), a methodolog- 
ical triangulation (Campbell, 1958, 1956), an 
operational delineation (Campbell, 1954), a con- 
vergent validation. 

Underwood's presentation and that of this 
paper as a whole imply moving from concept 
to operation, a sequence that is frequent in 
science, and perhaps typical. The same point 
can be made, however, in inspecting a transi- 
tion from operation to construct. For any body 
of data taken from a single operation, there is 
a subinfinity of interpretations possible; a sub- 
infinity of concepts, or combinations of con- 
cepts, that it could represent. Any single opera- 
tion, as representative of concepts, is equivocal. 
In an analogous fashion, when we view the 
Ames distorted room from a fixed point and 
through a síngle eye, the data of the retinal 
pattern are equivocal, in that a subinfinity of 
hexahedrons could generate the same pattern. 
The addition of a second viewpoint, as through 
binocular parallax, greatly reduces this equiv- 


ocality, greatly limits the constructs that could 
jointly account for both sets of data. In Gar- 
ner's (1954) study, the fractionation measures 
from a single method were equivocal—they 
could have been a function of the stimulus dis- 
tance being fractionated, or they could have 
been a function of the comparison stimuli used 
in the judgment proces. A multiple, conver- 
gent operationalism reduced this equivocality, 
showing the latter conceptualization to be the 
appropriate one, and revealing a preponder- 
ance of methods variance. Similarly for learn- 
ing studies: in identifying constructs with the 
response data from animals in a specific opera- 
tional setup there is equivocality which can 
operationally be reduced by introducing trans- 
position tests, different operations so designed 
as to put to comparison the rival conceptual- 
izations (Campbell, 1954). 

Garner's convergent operationalism and our 
insistence on more than one method for mea- 
suring each concept depart from Bridgman’s 
early position that “if we have more than one 
set of operations, we have more than one con- 
cept, and strictly there should be a separate 
name to correspond to each different set of 
operations” (Bridgman, 1927, p. 10). At the cur- 
rent stage of psychological progress, the crucial 
requirement is the demonstration of some con- 
vergence, not complete congruence, between 
two distinct sets of operations. With only one 
method, one has no way of distinguishing trait 
variance from unwanted method variance. 
When psychological measurement and concep- 
tualization become better developed, it may 
well be appropriate to differentiate conceptu- 
ally between Trait-Method Unit A, and Trait- 
Method Unit Ag, in which Trait A is measured 
by different methods. More likely, what we have 
called method variance will be specified theo- 
retically in terms of a set of constructs. (This 
has in effect been illustrated in the discussion 
above in which it was noted that the response 
set variance might be viewed as trait variance.) 
It will then be recognized that measurement 
procedures usually involve several theoretical 
constructs in joint application. Using obtained 
measurements to estimate values for a single 
construct under this condition still requires 
comparison of complex measures varying in 
their trait composition, in something like a 
multitrait-multimethod matrix. Mill's joint 
method of similarities and differences still epit- 
omizes much about the effective experimental 
clarification of concepts. 


VALIDATION BY MULTITRAIT-MULTIMETHOD MATRIX 287 


The Evaluation of a Multitrait- 
Multimethod Matrix 


The evaluation of the correlation matrix 
formed by intercorrelating several trait-method 
units must take into consideration the many 
factors which are known to affect the magnitude 
of correlations. A value in the validity diagonal 
must be assessed in the light of the reliabilities 
of the two measures involved: e.g., a low reli- 
ability for Test A, might exaggerate the appar- 
ent method variance in Test A,. Again, the 
whole approach assumes adequate sampling of 
individuals: the curtailment of the sample with 
respect to one or more traits will depress the 
reliability coefficients and intercorrelations in- 
volving these traits. While restrictions of range 
over all traits produces serious difficulties in the 
interpretation of a multitrait-multimethod ma- 
trix and should be avoided whenever possible, 
the presence of different degrees of restriction 
on different traits is the more serious hazard to 
meaningful interpretation. 

Various statistical treatments for multitrait- 
multimethod matrices might be developed. We 
have considered rough tests for the elevation 
of a value in the validity diagonal above the 
comparison values in its row and column. Cor- 
relations between the columns for variables 
measuring the same trait, variance analyses, and 
factor analyses have been proposed to us. How- 
ever, the development of such statistical meth- 
ods is beyond the scope of this paper. We 
believe that such summary statistics are neither 
necessary nor appropriate at this time. Psychol- 
ogists today should be concerned not with eval- 
uating tests as if the tests were fixed and 
definitive, but rather with developing better 
tests. We believe that a careful examination of 
a multitrait-multimethod matrix will indicate 
to the experimenter what his next steps should 
be: it will indicate which methods should be 
discarded or replaced, which concepts need 
sharper delineation, and which concepts are 
poorly measured because of excessive or con- 
founding method variance. Validity judgments 
based on such a matrix must take into account 
the stage of development of the constructs, the 
postulated relationships among them, the level 
of technical refinement of the methods, the rel- 
ative independence of the methods, and any 
pertinent characteristics of the sample of Ss. 
We are proposing that the validational process 
be viewed as an aspect of an ongoing program 
for improving measuring procedures and that 
the “validity coefficients” obtained at any one 


stage in the process be interpreted in terms of 
gains over preceding stages and as indicators of 
where further effort is needed. 


The Design of a Multitrait- 
Multimethod Matrix 


The several methods and traits included in a 
validational matrix should be selected with care. 
The several methods used to measure each trait 
should be appropriate to the trait as concep- 
tualized. Although this view will reduce the 
range of suitable methods, it will rarely restrict 
the measurement to one operational procedure. 

Wherever possible, the several methods in 
one matrix should be completely independent 
of each other: there should be no prior reason 
for believing that they share method variance. 
This requirement is necessary to permit the 
values in the heteromethod-heterotrait triangles 
to approach zero. If the nature of the trait rules 
out such independence of methods, efforts 
should be made to obtain as much diversity as 
possible in terms of data-sources and classifica- 
tion processes. Thus, the classes of stimuli or 
the background situations, the experimental 
contexts, should be different. Again, the per- 
sons providing the observations should have 
different roles or the procedures for scoring 
should be varied. 

Plans for a validational matrix should take 
into account the difference between the in- 
terpretations regarding convergence and dis- 
crimination. It is sufficient to demonstrate 
convergence between two clearly distinct meth- 
ods which show little overlap in the heterotrait- 
heteromethod triangles. While agreement be- 
tween several methods is desirable, convergence 
between two is a satisfactory minimal require- 
ment. Discriminative validation is not so easily 
achieved. Just as it is impossible to prove the 
null hypothesis, or that some object does not 
exist, so one can never establish that a trait, as 
measured, is differentiated from all other traits. 
One can only show that this measure of Trait 
A has little overlap with those measures of B 
and C, and no dependable generalization be- 
yond B and C can be made, For example, social 
poise could probably be readily discriminated 
from aesthetic interests, but it should also be 
differentiated from leadership. 

Insofar as the traits are related and are ex- 
pected to correlate with each other, the mono- 
method correlations will be substantial and 
heteromethod correlations between traits will 
also be positive. For ease of interpretation, it 


288 ATTITUDE MEASUREMENT 


may be best to include in the matrix at least 
two traits, and preferably two sets of traits, 
which are postulated to be independent of each 
other. 

In closing, a word of caution is needed. Many 
multitrait-multimethod matrices will show no 
convergent validation: no relationship may be 
found between two methods of measuring a 
trait. In this common situation, the experi- 
menter should examine the evidence in favor 
of several alternative propositions: (a) Neither 
method is adequate for measuring the trait; (b) 
One of the two methods does not really measure 
the trait. (When the evidence indicates that a 
method does not measure the postulated trait, 
it may prove to measure some other trait. High 
correlations in the heterotrait-heteromethod tri- 
angles may provide hints to such possibilities.) 
(c) The trait is not a functional unity, the re- 


SUMMARY 


This paper advocates a validational process 
utilizing a matrix of intercorrelations among 
tests representing at least two traits, each mea- 
sured by at least two methods. Measures of the 
same trait should correlate higher with each 
other than they do with measures of different 
traits involving separate methods. Ideally, these 
validity values should also be higher than the 
correlations among different traits measured by 
the same method. 

Illustrations from the literature show that 
these desirable conditions, as a set, are rarely 
met. Method or apparatus factors make very 
large contributions to psychological measure- 
ments. 

The notions of convergence between inde- 
pendent measures of the same trait and discrim- 


ination between measures of different traits are 
compared with previously published formula- 
tions, such as construct validity and convergent 
operationalism. Problems in the application of 
this validational process are considered. 


sponse tendencies involved being specific to the 
nontrait attributes of each test. The failure to 
demonstrate convergence may lead to concep- 
tual developments rather than to the abandon- 
ment of a test. 
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32. An Approach to the Study of Communicative Acts 


THEODORE M. NEWCOMB 


This paper points toward the possibility that 
many of those phenomena of social behavior 
which have been somewhat loosely assembled 
under the label of “interaction” can be more 
adequately studied as communicative acts. It 
further points to the possibility that, just as 
the observable forms of certain solids are mac- 
roscopic outcomes of molecular structure, so 
certain observable group properties are prede- 
termined by the conditions and consequences of 
communicative acts. 

The initial assumption is that communication 
among humans performs the essential function 
of enabling two or more individuals to main- 
tain simultaneous orientation toward one an- 
other as communicators and toward objects of 
communication, After presenting a rationale for 
this assumption, we shall attempt to show that 
a set of propositions derived from or consistent 
with it seems to be supported by empirical find- 
ings. 


CO-ORIENTATION AND THE A-B-X SYSTEM 


Every communicative act is viewed as a trans- 
mission of information, consisting of discrimi- 
native stimuli, from a source to a recipient.t 
For present purposes it is assumed that the dis- 
criminative stimuli have a discriminable object 
as referent. Thus in the simplest possible com- 
municative act one person (A) transmits infor- 
mation to another person (B) about something 


* Excerpted from Psychological Review, 1953, 60, 
393-404, with permission of the author and the 
American Psychological Association. 


1 This statement is adapted from G. A. Miller's 
definition: “ ‘information’ is used to refer to the 
occurrence of one out of a set of alternative dis- 
criminative stimuli. A discriminative stimulus. is a 
stimulus that is arbitrarily, symbolically, associated 
with some thing (or state, or event, or property) 
and that enables the stimulated organism to dis- 
criminate this thing from others" (9, p. 41). 


(X). Such an act is symbolized here as AtoBreX. 

The term "orientation" is used as equivalent 
to "attitude" in its more inclusive sense of re- 
ferring to both cathectic and cognitive tenden- 
cies. The phrase "simultaneous orientation" 
(hereinafter abbreviated to "co-orientation") 
itself represents an assumption; namely, that 
A's orientation toward B and toward X are in- 
terdependent. A-B-X is therefore regarded as 
constituting a system. That is, certain definable 
relationships between A and B, between A and 
X, and between B and X are all viewed as in- 
terdependent. For some purposes the system 
may be regarded as a phenomenal one within 
the life space of A or B, for other purposes as 
an "objective" system including all of the pos- 
sible relationships as inferred from observations 
of A's and B's behavior. It is presumed that a 
given state of the system exists when a given 
instance of AtoBreX occurs, and that as a result 
of this occurrence the system undergoes some 
change (even though the change be regarded as 
only a reinforcement of the pre-existing state). 

The minimal components of the A-B-X sys- 
tem, as schematically illustrated in Figure 1, are 
as follows: 


1. A's orientation toward X, including both 
attitude toward X as an object to be ap- 
proached or avoided (characterized by sign and 
intensity) and cognitive attributes (beliefs and 
cognitive structuring). 

2. A's orientations toward B, in exactly the 
same sense. (For purposes of avoiding confus- 
ing terms, we shall speak of positive and nega- 
tive attraction toward A or B as persons, and 

y E 
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Figure 1. Schematic illustration of the minimal 
A-B-X. system. 
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of favorable and unfavorable attitudes toward 
X) 

3. B's orientation toward X. 

4. B's orientation toward A. 


In order to examine the possible relation- 
ships of similarity and difference between A 
and B, we shall make use of simple dichotomies 
in regard to these four relationships. That is, 
with respect to a given X at a given time, A 
and B will be regarded as cathectically alike 
(++ or — —) or different (+ — or — +) in 
attitude and in attraction; and as cognitively 
alike or different. We shall also make use of 
simple dichotomies of degree—i.e., more alike, 
less alike. We shall refer to lateral similarities 
of A's and B's orientations to X as symmetrical 
relationships. 

This very simple system is designed to fit 
two-person communication. In the following 
discussion these additional limitations will be 
imposed, for simplicity sake: (a) communi- 
cative acts will be treated as verbal ones, in 
face-to-face situations; (b) initiation of the com- 
municative act is considered to be intentional 
(i.e. such acts are excluded as those which the 
actor assumes to be unobserved); (c) it is as- 
sumed that the "message" is received—i.e., that 
the communicative act is attended. to by an 
intended recipient, though not necessarily with 
any particular degree of accuracy; and (d) A 
and B are assumed to be group members, char- 
acterized by continued association. 

The assumption that co-orientation is essen- 
tial to human life is based upon two consider- 
ations of complementary nature. First, the 
orientation of any A toward any B (assuming 
that they are capable of verbal communication) 
is rarely, if ever, made in an environmental 
vacuum. Even in what seems the maximally 
"pure" case of two lovers oblivious to all but 
each other, both singly and both jointly are 
dependent upon a common environment; and 
their continued attachment is notoriously con- 
tingent upon the discovery or development of 
common interests beyond themselves. It is not 
certain that even their most person-oriented 
communications (e.g., "I love you") are devoid 
of environmental reference. The more intense 
one person's concern for another the more 
sensitive he is likely to be to the other's orien- 
tations to objects in the environment. 

Second, the orientation of any A capable of 
verbal communication about almost any con- 
ceivable X. is rarely, if ever, made in a social 


vacuum. There are few if any objects so pri- 
vate that one’s orientations toward them are 
uninfluenced by others’ orientations. This is 
particularly true with regard to what has been 
termed "social reality" (3); ie. the less the 
possibility of testing one’s assumptions by 
observing the physical consequences of those 
assumptions, the greater the reliance upon so- 
cial confirmation as the test of what is true 


and valid. And even when assumptions can be ` 


put to the direct test (e.g., the child can find 
out for himself about the stove which he has 
been told is hot), social reality is often accepted 
as the quicker or the safer test. As various 
linguists have pointed out, moreover, a good 
deal of social reality is built into the very lan- 
guage with which we communicate about 
things. Under the conditions of continued as- 
sociation which we are assuming, A and B 
as they communicate about X are dependent 
upon each other, not only because the other's 
eyes and ears provide an additional source of 
information about X, but also because the 
other's judgment provides a testing ground for 
social reality. And to be dependent upon the 
other, in so far as such dependence influences 
behavior, is to be oriented toward him. 

In short, it is an almost constant human 
necessity to orient oneself toward objects in 
the environment and also toward other persons 
oriented toward those same objects. To the 
degree that A's orientation either toward X or 
toward B is contingent upon B's orientation 
toward X, A is motivated to influence and/or 
to inform himself about B's orientation toward 
X. Communication is the most common and 
usually the most effective means by which he 
does so. 


SYMMETRY OF ORIENTATION 


Much of the remainder of this paper will 
deal with the relationships between A's and 
B's orientations toward X, within the postu- 
lated A-B-X system. The implications of this 
model are: (a) that while at any given moment 
the system may be conceived of as being "at 
rest," it is characterized not by the absence but 
by the balance of forces; and (b) that a change 
in any part of the system (any of the four 
relationships portrayed in Figure 1) may lead 
to changes in any of the others. We shall also 
make the assumption (not inherent in the 
model) that certain forces impinging upon the 
system are relatively strong and persistent, and 
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that thus there are “strains” toward preferred 
states of equilibrium. 

This assumption, related to the initial one 
concerning the co-orientation function of com- 
munication, is as follows. To the degree that 
A's orientation toward X is contingent upon 
B's orientation toward X, A's co-orientation 
will be facilitated by similarity of his own and 
B's orientation toward X. The first advantage 
of symmetry—particularly of cognitive sym- 
metry—is that of ready calculability of the 
other's behavior; the more similar A's and B's 
cognitive orientations, the less the necessity for 
either of them to "translate" X in terms of 
the other's orientations, the less the likelihood 
of failure or error in such "translations," and 
thus the less difficult and/or the less erroneous 
the co-orientation of either. Second, there is 
the advantage of validation of one's own orien- 
tation toward X; the more similar A's and B's 
orientations, either cognitive or cathectic (par- 
ticularly in the many areas where validation 
is heavily dependent upon "social reality"), 
the more confident each of them can be of his 
own cognitive and evaluative orientations. Co- 
orientation is of course possible with little or 
no symmetry, but the facilitative value of sym- 
metry for co-orientation is considerable. 

If these advantages are commonly experi- 
enced as such, communicative acts resulting in 
increased symmetry are likely to be rewarded, 
and symmetry is likely to acquire secondary 
reward value. This is the basis of our assump- 
tion of a persistent “strain toward symmetry,” 
under the conditions noted. 

These assumptions may now be brought 
together in terms of the following inclusive 
postulate: The stronger the forces toward A’s 
co-orientation in respect to B and X, (a) the 
greater A’s strain toward symmetry with B in 
respect to X; and (b) the greater the likelihood 
of increased symmetry as a consequence of one 
or more communicative acts. The latter part of 
the postulate assumes the possibility of modified 
orientations toward X on the part of both A 
and B, who over a period of time exchange 
roles as transmitters and receivers of informa- 
tion. 

Several testable propositions are derivable 
from this postulate. First, if the likelihood of 
instigation to and achievement of symmetry 
varies as a function of forces toward co-orien- 
tation, the latter varies, presumably, with 
valence of the objects of co-orientation—i.e., 
of intensity of attitude toward X and of attrac- 


tion toward B. That is, under conditions such 
that orientation toward either B or X also 
demands orientation toward the other, the 
greater the valence of B or of X the greater the 
induced force toward co-orientation, and thus 
the greater the likelihood of both instigation 
toward and achievement of symmetry. 

Such research findings as are known to the 
writer are in support of these predictions. 
Experimental results reported by Festinger and 
"Thibaut (5), by Schachter (12), and by Back (1) 
indicate that attempts to influence another 
toward one's own point of view vary as a func- 
tion of attraction. In the second of these 
studies it is shown that communications within 
a cohesive group are directed most frequently 
toward those perceived as deviates, up to a 
point where the deviate is sociometrically re- 
jected (ie, attraction decreases or becomes 
negative), beyond which point communication 
to them becomes less frequent. It is also shown 
in this study that frequency of influence- 
attempting communication varies with degree 
of interest in the topic of group discussion. 

Some of these same studies, and some others, 
present data concerning symmetry as a conse- 
quence of communication. Thus Festinger and 
Thibaut, varying "pressure toward uniformity" 
and "perception of homogeneous group com- 
position," found actual change toward uni- 
formity following a discussion to be a function 
of both these variables, but some change toward 
uniformity took place in every group, under all 
conditions. Back found that subjects who 
started with different interpretations of the 
same material and who were given an oppor- 
tunity to discuss the matter were influenced 
by each other as a direct function of attraction. 

Findings from two community studies may 
also be cited, as consistent with these labora- 
tory studies. Newcomb (10), in a replicated 
study of friendship choices as related to politi- 
cal attitudes in a small college community, 
found on both occasions that students at each 
extreme of the attitude continuum tended to 
have as friends those like themselves in attitude. 
Festinger, Schachter, and Back (4) in their 
study of a housing project, found a correlation 
of +.72 between a measure of attraction and 
a measure of "conformity in attitude." No 
direct observations of communication are made 
in these two studies; the relevance of their find- 
ings for the present point depends upon the 
assumption that frequency of communication 
is a function of attraction. This assumption 
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is clearly justified in these two particular inves- 
tigations, since in both communities there was 
complete freedom of association. As noted 
below, this assumption is not justified in all 
situations. 

Other testable propositions derivable from 
the general postulate have to do with A's judg- 
ments of existing symmetry between himself 
and B with respect to X. Such judgments (to 
which the writer has for some time applied the 
term “perceived consensus”) are represented by 
the symbol B-X, within A’s phenomenal A-B-X 
system. Such a judgment, under given condi- 
tions of demand for co-orientation with re- 
spect to a given B and a given X, is a major 
determinant of the likelihood of a given 
AtoBreX, since strain toward symmetry is in- 
fluenced by perception of existing symmetry. 
Such a judgment, moreover, is either confirmed 
or modified by whatever response B makes to 
AtoBreX. The continuity of an A-B-X system 
thus depends upon perceived consensus, which 
may be viewed either as an independent or 
as a dependent variable. 

According to the previous proposition, the 
likelihood of increased symmetry (objectively 
observed) as a consequence of communicative 
acts increases with attraction and with inten- 
sity of attitude. The likelihood of perceived 
symmetry presumably increases with the same 
variables. Judgments of symmetry, like other 
judgments, are influenced both by “reality” 
and by “autistic” factors, both of which tend, 
as a function of attraction and intensity of 
attitude, to increase the likelihood of perceived 
consensus. Frequency of communication with 
B about X is the most important of the 
“reality” factors, and this, as we have seen, 
tends to vary with valence toward B and 
toward X. As for the “autistic” factors, the 
greater the positive attraction toward B and 
the more intense the attitude toward X, the 
greater the likelihood of cognitive distortion 
toward symmetry. Hypothetically, then, per- 
ceived symmetry with regard to X varies as a 
function of intensity of attitude toward X and 
of attraction toward B. 

A considerable number of research studies, 
published and unpublished, are known to the 
writer in which subjects’ own attitudes are 
related to their estimates of majority or modal 
position of specified groups. Only a minority 
of the studies systematically relate these judg- 
ments to attraction, and still fewer to intensity 
of attitude. Among this minority, however, the 


writer knows of no exceptions to the above 
proposition. The most striking of the known 
findings were obtained from students in several 
university classes in April of 1951, in a ques 
tionnaire dealing with the very recent dismissal 
of General MacArthur by President Truman 
(Table 1). 


TABLE 1 
Pro Anti 
Truman Truman 
Ss Ss 
Who... Who... 


Attribute to “most of my closest friends” 
Pro-Truman attitudes 48 


Anti-Truman attitudes 0 $4 
Neither 4 4 
Attribute to “most uninformed people” 
Pro-Truman attitudes 6 13 
Anti-Truman attitudes 32 14 
Neither 14 13 


If we assume that “closest friends” are more 
attractive to university students than “unin- 
formed people,” these data provide support for 
the attraction hypothesis. Comparisons of those 
whose own attitudes are more and less intense 
also provide support, though less strikingly, for 
the hypothesis concerning attitude intensity. 
Perceived symmetry, viewed as an indepen- 
dent variable, is obviously a determinant of 
instigation to symmetry-directed communica- 
tion. Festinger (3), with specific reference to 
groups characterized by “pressures toward uni- 
formity,” hypothesizes that “pressure on mem- 
bers to communicate to others in the group 
concerning item x increases monotonically 
with increase in the perceived discrepancy in 
opinion concerning item x among members of 
the group,” as well as with “relevance of item 
x to the functioning of the group,” and with 
“cohesiveness of the group.” And, with refer- 
ence to the choice of recipient for communi- 
cations, “The force to communicate about 
item x to a particular member of the group 
will increase as the discrepancy in opinion 
between that member and the communicator 
increases [and] will decrease to the extent that 
he is perceived as not a member of the group 
or to the extent that he is not wanted as à 
member of the group” (8, p. 8). Support for 
all of these hypotheses is to be found in one 
or more of his and his associates’ studies. They 
are consistent with the following proposition: 
the likelihood of a symmetry-directed AtoBreX 
varies as a multiple function of perceived dis- 
crepancy (ie, inversely with perceived sym- 
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metry), with valence toward B and with 
valence toward X. 

Common sense and selected observations 
from everyday behavior may also be adduced 
in support of these propositions. For example, 
A observes that an attractive B differs with 
him on an important issue and seeks symmetry 
by trying to persuade B to his own point of 
view; or A seeks to reassure himself that B 
does not disagree with him; or A gives infor- 
mation to B about X or asks B for information 
about X. From all these acts we may infer 
perception of asymmetry and direction of com- 
munication toward symmetry. Selected obser- 
vations concerning symmetry as a consequence 
of communication are equally plentiful; there 
is, in fact, no social phenomenon which can 
be more commonly observed than the tendency 
for freely communicating persons to resemble 
one another in orientation toward objects of 
common concern. The very nature of the com- 
municative act as a transmission of information 
would, on a priori grounds alone, lead to the 
prediction of increased symmetry, since follow- 
ing the communication both A and B possess 
the information which was only A's before. 
B will not necessarily accept or believe all 
information transmitted by A, of course, but 
the likelihood of his doing so presumably varies 
not only with attraction toward A but also 
with intensity of attitude toward X, since in 
the long run the more important X is to him 
the more likely it is that he will avoid commu- 
nicating with B about X if he cannot believe 
him. Thus the propositions have a considerable 
degree of face validity. 

But everyday observation also provides in- 
stances to the contrary. Not all communications 
are directed toward symmetry, nor is symmetry 
an inevitable consequence of communication, 
even when attraction is strong and attitudes 
are intense, A devoted husband may refrain 
from discussing important business matters 
with his wife, or two close friends may “agree 
to disagree” in silence about matters of im- 
portance to both. People who are attracted 
toward one another often continue to com- 
municate about subjects on which they con- 
tinue to disagree—and this is particularly apt 
to happen with regard to attitudes which ‘are 
intense, contrary to our theoretical prediction. 

In sum, the available research findings and 
a considerable body of everyday observation 
support our predictions that instigation to- 
ward, perception of, and actual achievement 


of symmetry vary with intensity of attitude 
toward X and attraction toward B. The readi- 
ness with which exceptions can be adduced, 
however, indicates that these are not the only 
variables involved. The propositions, at best, 
rest upon the assumption of ceteris paribus; 
they cannot account for the fact that the prob- 
abilities of A's instigation to communicate 
about a given X are not the same for all 
potential B’s of equal attraction for him, nor 
the fact that his instigation to communicate 
to a given B are not the same for all X's of 
equal valence to him. We shall therefore 
attempt to derive certain further propositions 
from our basic assumption that both instiga- 
tion to and achievement of symmetry vary with 
strength of forces toward co-orientation in the 
given situation. 


DYNAMICS OF CO-ORIENTATION 


The foregoing propositions represent only a 
slight extrapolation of Heider's general prin- 
ciple (6) of "balanced states" in the absence 
of which "unit relations will be changed 
through action or through cognitive reorgani- 
zation.” In a later paper devoted specifically 
to the implications of Heider's hypotheses for 
interrelationships among attitudes toward a 
person and toward his acts, Horowitz et al. (8) 
note the following possible resolutions to states 
of imbalance: (a) the sign-valence of the act is 
changed to agree with that of the actor; (b) the 
reverse of this; and (c) the act is cognitively 
divorced from the actor; in addition, of course, 
the disharmony may be tolerated. 

Orientations as attributed by A to B are 
here considered as equivalent to acts so attrib- 
uted, in Heider's sense, and symmetry is taken 
as a special case of balance. Assume, for 
example, the following asymmetry in A's 
phenomenal system: +A:X, +A:B, —B:X, 
+B:A (i.e, A has positive attitude toward X, 
positive attraction toward B, perceives B's at- 
titude toward X as negative, and B's attraction 
toward A as positive). Any of the following at- 
tempts at "resolution," analogous to those men- 
tioned by Heider, are possible: (a) —A:X; (b) 
—A:B; or (c) cognitive dissociation. These can 
occur in the absence of any communication 
with B. Attempts at harmony (symmetry) may 
also be made via communications directed to- 
ward +B:X. And, if such attempts fail, the 
three alternatives mentioned as possible without 
communication are still available. Finally, there 
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is the possibility of compromise, following com- 
munication (e.g., agreement on some midpoint), 
and the possibility of “agreeing to disagree.” 

Such acts of resolution are made necessary, 
according to the present theory, by the situa- 
tional demands of co-orientation on the one 
hand and by the psychological strain toward 
symmetry on the other. But symmetry is only 
a facilitating condition for co-orientation, not 
a necessary one. While (as maintained in the 
preceding propositions) the probabilities of 
symmetry vary, ceteris paribus, with demand for 
co-orientation, the theory does not demand that 
a symmetry-directed AtoBreX occur in every 
instance of strong demand for co-orientation. 
On the contrary, the theory demands that it 
occur only if, as, and when co-orientation is 
facilitated thereby. We must therefore inquire 
more closely into the nature of the forces to- 
ward co-orientation as related to possible forces 
against symmetry. 

One kind of situational variable has to do 
with the nature of the forces which result in 
association between A and B. Of particular im- 
portance are constrained (enforced) vs. volun- 
tary association, and association based upon 
broad as contrasted with narrow common in- 
terests. The range of X's with regard to which 
there is demand for co-orientation is presum- 
ably influenced by such forces. The relevant 
generalization seems to be as follows: The less 
the attraction between A and B, the more nearly 
strain toward symmetry is limited to those par- 
ticular X’s co-orientation toward which is re- 
quired by the conditions of association. This 
would mean, for example, that as attraction 
between two spouses decreases, strain toward 
symmetry would increasingly narrow to such 
X's as are required by personal comfort and 
conformity with external propriety; similarly, 
the range of X's with regard to which there is 
strain toward symmetry is greater for two 
friendly than for two hostile members of a 
chess club. 

The problem of constraint has already been 
noted. In some of the studies cited above it was 
assumed that frequency of communication var- 
ies with attraction, but this is not necessarily 
true under conditions of forced association. 
Two recent theoretical treatises deal with this 
problem. 

Homans, one of whose group variables is 
“frequency of interaction” (though not commu- 
nication, specifically), includes the following 
among his other propositions: “If the frequency 


of interaction between two or more persons 
increases, the degree of their liking for one an- 
other will increase, and vice versa"; and “The 
more frequently persons interact with one an- 
other, the more alike in some respects both their 
activities and their sentiments tend to become” 
(7, p. 120). (The latter proposition, which 
closely resembles the one here under considera- 
tion, apparently takes a much less important 
place in Homans’ system than the former.) Al- 
most immediately, however, the latter propo- 
sition is qualified by the statement, “It is only 
when people interact as social equals and their 
jobs are not sharply differentiated that our hy- 
pothesis comes fully into its own." In nearly 
every chapter, moreover, Homans (whose prop- 
ositions are drawn post hoc from various com- 
munity, industrial, and ethnological studies) 
points to the limitations which are imposed by 
constraining forces—particularly those of rank 
and hierarchy—upon the relations among at- 
traction, similarity of attitude, and communi- 
cation. 

Blake manages to incorporate these consider- 
ations in a more rigorous proposition. Noting 
that hostility cannot be considered as the simple 
psychological opposite of positive attraction, he 
proposes to substitute a curvilinear for Homans’ 
linear hypothesis: “. . . when pressures operate 
to keep members of a group together, the 
stresses that drive toward interaction will be 
stronger in both positive and negative feeling 
states than in neutral ones” (2). This proposi- 
tion seems consistent with the present argument 
to the effect that demands for co-orientation are 
likely to vary with the nature and degree of 
constraints upon association; hence communi- 
cative acts, together with their consequences, 
will also vary with such constraints. 

Another situational variable deals with the 
fact that, under conditions of prescribed role 
differentiation, symmetry may take the form of 
“complementarity” (cf. 11) rather than same- 
ness. For example, both a man and his small 
son may (following a certain amount of com- 
munication of a certain nature) subscribe to the 
same norms which prescribe differentiated be- 
havior for man and boy with respect to à 
whiskey and soda. If the father drinks in the 
son's presence, there are demands upon both 
of them for co-orientation; but there is strain 
toward symmetry only with respect to "the 
code," and not with respect to personal orien- 
tation toward the whiskey and soda. The code 
becomes the X with regard to which there is 
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strain toward symmetry. In more general terms, 
under conditions of differentiation of A’s and 
B’s role prescriptions with regard to X, the 
greater the demand for co-orientation the 
greater the likelihood of strain toward sym- 
metry with respect to the role system (rather 
than with respect to X itself). 

A third situational variable has to do with 
the possibility that symmetry may be threaten- 
ing. Particularly under circumstances of shame, 
guilt, or fear of punishment there are apt to 
be strong forces against a symmetry-directed 
AtoBreX, even though—in fact, especially when 
—attitude toward X (the guilty act) and attrac- 
tion toward B (a person from whom it is to be 
concealed) are strong. Under these conditions 
it is the demand for co-orientation which cre- 
ates the problem; if A could utterly divorce X 
(his own act) from B, he would not feel guilty. 
Forces toward symmetry, however, are opposed 
by counterforces. Demand for co-orientation in- 
duces strain toward symmetry, but does not nec- 
essarily lead to a symmetry-directed AtoBreX. 

A theoretically analogous situation may re- 
sult from the omnipresent fact of multiple 
membership groups. That is, strains. toward 
symmetry with B, in regard to X may be out- 
weighed by strains toward symmetry with Bs, 
whose orientations toward X are viewed as con- 
tradictory with those of B,. This is often the 
case when, for example, two good friends "agree 
to disagree" about something of importance to 
both. Thus in one study (13) it was found that 
those members least influenced by reported in- 
formation concerning their own group norms 
were those most attracted to groups whose 
norms were perceived as highly divergent from 
those of the group in question. 

Communicative acts, like others, are thus sub- 
ject to inhibition. Such "resolutions" as "agree- 
ment to disagree," however, represent relatively 
stressful states of equilibrium. It is therefore to 
be expected, in ways analogous to those noted 
by Lewin in his discussion of the quasi-station- 
ary equilibrium, that A-B-X systems character- 
ized by such stress will be particularly susceptible 
to change. Such change need not necessarily 
occur in the particular region of the system 
characterized by maximal strain. 

The dynamics of such a system are by no 
means limited to those of strains toward sym- 
metry, but must include changes resulting from 
acceptance of existing asymmetry. The possible 
range of dynamic changes is illustrated in Fig- 
ure 2, (In this figure, the A and B at either 


Figure 2. 


side represent persons as communicators; the 
A and B in the center represent the same per- 
sons as objects of co-orientation. The broken 
lines represent A's judgments of B's orienta- 
tions.) Given perceived asymmetry with regard 
to X, and demand for co-orientation toward B 
and X, the possibilities for A are such that he 
can do the following: 


l. Achieve, or attempt to achieve, symmetry 
with regard to X: 
(a) By influencing B toward own orienta- 
tion. 
(b) By changing own orientation toward 
B's. 
(c) By cognitively distorting B's orienta- 
tion. 
2. Introduce changes in other parts of the 
system: 
(a) Modify his attraction toward B. 
(b) Modify his judgment of own attrac- 
tion for B. 
(c) Modify evaluation of (attraction to- 
ward) himself (A). 
(d) Modify his judgment of B's evaluation 
of himself (B). 
3. Tolerate the asymmetry, without change. 


As suggested by this listing of possible "solu- 
tions," the perception of asymmetry, under con- 
ditions of demand for co-orientation, confronts 
A with a problem which he can attempt to 
solve behaviorally (i.e, by communicative acts) 
and/or cognitively (i.e., by changing either his 
own orientations or his perception of B's orien- 
tations). Whatever his chosen "solution," it has 
some effect upon A's phenomenal A-B-X system 
—either to reinforce it or to modify it. As a 
result of repeatedly facing and "solving" prob- 
lems of co-orientation with regard to a given 
B and a given X, a relatively stable equilibrium 
is established. If A is free either to continue or 
not to continue his association with B, one or 
the other of two eventual outcomes is likely: 
(a) he achieves an equilibrium characterized by 
relatively great attraction toward B and by 
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relatively high perceived symmetry, and the as- 
sociation is continued; or (b) he achieves an 
equilibrium characterized by relatively little 
attraction toward B and by relatively low per- 
ceived symmetry, and the association is dis- 
continued. This "either-or" assumption under 
conditions of low constraint presupposes a cir- 
cular relationship between attraction and the 
perception of symmetry. The present theory 
demands this assumption of circularity, and em- 
pirical evidence (under conditions of relative 
freedom from constraint) seems to support it. 

Under conditions of little or no freedom to 
discontinue association, no such circularity is 
assumed. The conditions which dictate con- 
tinued association also dictate the requirements 
for co-orientation, which are independent of 
attraction. The empirical data suggest that the 
degree to which attraction is independent of 
symmetry varies with the degree of perceived 
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33. The Principle of Gongruity in the Prediction of 


Attitude Change 


CHARLES E. OSGOOD and PERCY H. TANNENBAUM 


The theoretical model presented in this paper, 
while not pretending to take account of all vari- 
ables relating to attitude change, does attempt 
to cover those variables believed to be most 
significant with respect to the direction of 
change to be expected in any given situation. 
These variables are (a) existing attitude toward 
the source of a message, (b) existing attitude 
toward the concept evaluated by the source, 
and (c) the nature of the evaluating assertion 
which relates source and concept in the mes- 
sage. Predictions generated by the theory about 
the directions and relative amounts of attitude 
change apply to both sources and the concepts 
they evaluate. 


UNDERLYING NOTIONS 


Our work on attitude theory and measure- 
ment is an outgrowth of continuing research 
on experimental semantics, particularly the 
development of objective methods for measur- 
ing meaning (4, 5). From this viewpoint, the 
meaning of a concept is its location in a space 
defined by some number of factors or dimen- 
sions, and attitude toward a concept is its pro- 
jection onto one of these dimensions defined 
as “evaluative.” In the factor analytic work 
we have done so far, the first and most heavily 
loaded factor is always one clearly identifiable 
as evaluative by the labels of the scales it rep- 
resents, eg, good-bad, fair-unfair, valuable- 
worthless, pleasant-unpleasant, and the like. 
This conception of attitude as a dimension or 
factor in total meaning has a number of im- 
plications, including those explored in the 
present paper. It implies, for example, that 


* Reprinted from Psychological Review, 1955, 62, 
42-55, with permission of the senior author and 
the American Psychological Association. 


people having the same attitude toward a con- 
cept, such as NEGRO, may be sharply differen- 
tiated in terms of other dimensions of the 
semantic space (e.g., some perceiving NEGRO as 
powerful and active, others as weak and pas- 
sive).1 

Attitudes toward the various objects of judg- 
ment associated in messages must be measured 
in the same units if comparative statements 
about attitude change are to be made. There 
have been attempts to devise generalized atti- 
tude scales in the history of this field (cf. 6, 7), 
but if one is to judge the criterion of accep- 
tance and use, they have not been outstand- 
ingly successful. In applying the semantic 
differential (a label that has come to be ap- 
plied to our measuring instrument), various 
objects of judgment, sources and concepts, are 
rated against a standard set of descriptive scales. 
To the extent that location on the evaluative 
dimension of the semantic differential is a reli- 
able and valid index of attitude (as determined 
by correlation with other criteria), it is then 
necessarily a generalized attitude scale. We 
have some evidence for validity? and more is 
being obtained; reliability of the differential, 
particularly the evaluative dimension, is rea- 
sonably high, running in the .80's and .90's 
in available data. 

Another underlying notion about human 
thinking we have been exploring is that judg- 
mental frames of reference tend toward max- 


1 A study in progress exhibits precisely this phe- 
nomenon with respect to the concept NEGRO, Sim- 
ilar findings are evident with respect to THE CHURCH 
and CAPITAL PUNISHMENT, 

2 For example, the correlations between scores on 
the evaluative scales of the semantic differential 
and scores on the Thurstone scales on attitude to- 
ward THE CHURCH, NEGRO, and CAPITAL PUNISHMENT 
are .74, 82, and 81, respectively. 
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imal simplicity. Since extreme, “all-or-nothing” 
judgments are simpler than finely discriminated 
judgments of degree, this implies a continuing 
pressure toward polarization along the evalu- 
ative dimension (ie, movement of concepts 
toward either entirely good or entirely bad 
allocations). We have evidence that extreme 
judgments have shorter latencies than more 
discriminative judgments (5), and that extreme 
judgments are characteristic of less intelligent, 
less mature, less well educated, or more emo- 
tionally oriented individuals (8). Furthermore, 
since assumption of identity is a simpler pro- 
cess than maintenance of distinction, this also 
implies a continuing pressure toward elimina- 
tion of differences among concepts which are 
localized in the same direction of the evaluative 
framework. We have evidence that in the judg- 
ing of emotionally polarized concepts all scales 
of judgment tend to rotate toward the evalu- 
ative, e.g., their correlations with good-bad tend 
to increase and therefore the relative loading 
on the evaluative factor tends to increase (5). 

The most “simple-minded” evaluative frame 
of reference is therefore one in which a tight 
cluster of highly polarized and undifferentiated 
good concepts is diametrically opposed in 
meaning to an equally tight and polarized 
cluster of undifferentiated bad concepts. The 
same underlying pressure toward simplicity 
operates on any new or neutral concept to 
shift it one way or the other. For example, 
there is the tendency in American thinking, 
about which Pandit Nehru complains, requir- 
ing that India be either "for us or agin' us." 
This is, of course, the condition referred to by 
the general semanticists (e.g, Johnson, 2) as 
a "two-valued orientation," and it is unfortu- 
nately characteristic of lay thinking in any 
period of conflict and emotional stress. The 
more sophisticated thinker, according to this 
view, should show less tendency to polarize, 
more differentiation among concepts, and thus 


greater relative use of factors other than the 
evaluative. 


THE PRINCIPLE OF CONGRUITY 


The principle of congruity in human think- 
ing can be stated quite succinctly: changes in 
evaluation are always in the direction of in- 
creased congruity with the existing frame of 
reference. To make any use of this principle 
in specific situations, however, it is necessary 
to elaborate along the following lines: When 


does the issue of congruity arise? What direc- 
tions of attitude change are congruent? How 
much stress is generated by incongruity and 
how is it distributed among the objects of 
judgment? 


The Issue of Congruity 


Each individual has potential attitudes to- 
ward a near infinity of objects. It is possible 
to have varying attitudes toward diverse con- 
cepts without any felt incongruity or any pres- 
sure toward attitude change, as long as no 
association among these objects of judgment is 
made. As anthropologists well know, members 
of a culture may entertain logically incom- 
patible attitudes toward objects in their cul- 
ture (e.g, ancestor worship and fear of the 
dead) without any stress, as long as the incom- 
patibles are not brought into association. The 
issue of congruity arises whenever a message 
is received which relates two or more objects 
of judgment via an assertion. 


The simplest assertion is merely a descriptive 
statement: "Chinese cooking is good,” "Jeffer- 
son was right," "This neurotic modern art." To 
the extent that the evaluative location of a 
particular qualifier differs from that of the 
thing qualified, there is generated some pressure 
toward congruity. Similar pressure is generated 
by ordinary statements of classification: "Sena- 
tor McCarthy is a Catholic,” “Tom is an ex- 
con," "Cigarettes contain nicotine." To the 
extent that the evaluative locations of instance 
and class are different, some pressure toward 
congruity exists. A more complex situation is 
that in which a source makes an assertion about 
a concept: "University President Bans Research 
on Krebiozen"; "Communists like strong labor 
unions." This is the most commonly studied 
situation, and one for which we have some 
empirical data against which to test our hy- 
potheses. Assertions may be explicit linguistic 
statements of evaluation or implicit behavioral, 
situational statements. A newsphoto of Mrs. 
Roosevelt smiling and shaking hands with a 
little colored boy is just as effective in setting 
up pressures toward congruity as a verbal state- 
ment on her part. 


Directions of Congruence and Incongruence 


To predict the direction of attitude change 
from this general principle it is necessary tO 
take into account simultaneuosly the existing 
attitudes toward each of the objects of judg: 
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ment prior to reception of the message and 
the nature of the assertion which is embodied 
in the message. Attitudes can be specified as 
favorable (+), neutral (0), and unfavorable (—). 
Assertions can be specified as positive or asso- 
ciative (+) or negative or disassociative (—). 
They may also, of course, include evaluative 
loading (e.g, when X denounces Y, we have 
both a dissociative assertion and negative evalu- 
ation of Y). When attitudes toward both ob- 
jects of judgment are polar, the nature of the 
assertion determines congruence or incongru- 
ence. For EISENHOWER (+) to come out in favor 
of FREEDOM OF THE PRESS (4-) is, of course, con- 
gruent with the existing frame of reference of 
most people in this country, but for THE DAILY 
WORKER (—) to speak in favor of FREEDOM OF 
THE PRESS (+) is attitudinally incongruent. In 
this simplest of states in which human thinking 
operates, sources we like should always sponsor 
ideas we like and denounce ideas we are 
against, and vice versa. 

When the existing attitude toward one of 
the objects of judgment is neutral and the other 
polar, we must speak of what directions would 
be congruent. If, for example, a favorable 
source like EISENHOWER were to make a favor- 
able assertion about the MINISTER FROM SIAM 
(a neutral notion to most of us), it would be 
congruent if the latter were also favorable— 
hence pressure is generated toward attitude 
change in this direction. If prAvpA (—) spon- 
SOrs GRADUAL DISARMAMENT (0), the pressure is 
such as to make the relatively neutral notion 
of disarmament less favorable; similarly, if a 
PROFESSOR (0) as a source favors PREMARITAL 
SEXUAL RELATIONS (—) as making for better 
marriages, it is the PROFESSOR that becomes less 
favorable (this is not unlike the “guilt by asso- 
ciation” technique). Conversely, for our neutral 
PROFESSOR (0) to speak out against MORAL DE- 
PRAVITY (—) must have the effect of raising his 
esteem (this is the familiar “I am against sin” 
technique). 

When both objects of judgment are neutral, 
there is no question of congruity between them, 
and movement is determined solely by the na- 
ture of the assertion, i.e, this becomes a case 
of simple qualification or classification. If MR. 
Jones denounces Mm. sMITH, neither of whom 
is known, there is presumably some negative 
pressure on MR. SMITH by virtue of the sheer 
devaluation of “being denounced.” Since the 
evaluation applies to the concept and not the 
source, the effect should be chiefly upon the 


concept. We shall find evidence for such an 
“assertion effect” in the available data. 

We may now make a general statement gov- 
erning the direction of congruence which will 
hold for any object of judgment, source or 
concept, and any type of assertion. 


1. Whenever one object of judgment is 
associated with another by an assertion, its con- 
gruent position along the evaluative dimension 
is always equal in degree of polarization (d) to 
the other object of judgment and in either the 
same (positive assertion) or opposite (negative 
assertion) evaluative direction. 


This is to say that we have attitude scores 
toward two objects of judgment, OJ, and OJo, 
and to each of these scores we assign a value, 
d, which represents the degree of polarization 
of that attitude. Thus we have do, and do,,. 
Since the measuring instrument which has been 
used in our quantitative work so far (the seman- 
tic differential) treats the evaluative dimension 
as a 7-step scale with “4” defined as the neutral 
point, we have three degrees of polarization in 
each direction, i.e, + 3, + 2, + 1, 0, — 1, — 2, 
—8. Given OJ, and OJ» associated with one 
another through either a positive (0J,A + OJ) 
or negative (07,4 — OJ) assertion, we define 
the congruent position (c) of either object of 
judgment as follows: If OJ,4 + OJ, then 


dcos, = doy, (1) 
dos = dos, (2) 
If OJ,A — OJ, then 
dcoj, = — do, (3) 
deog, = — doy: (4) 


Figure 1 provides some graphic illustrations. 
In example 1, we have a positive assertion 
(indicated by the + on the bar connecting 
source and concept) associating two equally 
favorable objects of judgment; in this situation 
maximum congruity already exists. In all the 
other illusttations given, the existing positions 
are not those of maximum congruence, and 
those positions which would be maximally 
congruent for each object of judgment are 
shown by dashed circles. In situation 3, for 
example, a congruent source would be at —2 
and a congruent concept would be at +3, given 
the favorable assertion between two items of 


opposite sign. 


Figure 1. Graphic examples of four situations in 
which a source (s) makes an assertion (4 or —) 
about a concept (c). Positions of maximum con- 
gruity are indicated by dashed circles; predicted 
changes in attitude are indicated by arrows. See 
discussion in text. 


Magnitude and Distribution of 
Pressure toward Congruity 


Knowing the existing locations of maximum 
congruence under the given conditions (by 
applying Principle 1), it becomes possible to 
state the amount and direction of application 
of total available pressure toward congruity. 


2. The total available pressure toward con 
gruity (P) for a given object of judgment asso- 
ciated with another by an assertion is equal to 
the difference, in attitude scale units, between 
its existing location and its location of maxi- 
mum congruence along the evaluative dimen- 
sion; the sign of this pressure is positive (+) 
when the location of congruence is more favor- 
able than the existing location and negative 
(—) when the location of congruence is less 
favorable than the existing location. 


That is, 
Pos, = dcos, — dos,» (5) 
Pos, = deos, — dozy (6) 


Therefore, substituting from equations 1 
through 4, if OJ,A + OJy 


Pos, = dos, — dos» (7) 
Pos, = dos, — doy, (8) 

If OJ,A — OJ, 
Pos, = — dos, — dos, (9) 


Pos, = — 403, — dosy (10) 


The resulting signs of equations 7 through 10 
then represent the direction of P. 

For example 2 in Figure 1, the total pressure 
toward congruity available for the source is 
—2 units and for the concept is +2 units. As 
can be seen by inspection of these examples, 
the total pressures toward congruity for both 
objects associated by an assertion are al- 
ways equal in magnitude, although they 
may be the same or different in sign (ie. 
|Po;,| = |Poz,]). The upper figures in each cell 
of Table 2 give the total pressures and direc- 
tions of application for all possible relations 
among sources and concepts and both types of 
assertions. These computations are based upon 
the assumption of a 7-step scale with three de- 
grees of polarization possible in each evaluative 
direction; they may be treated as general index 
numbers. 

The third principle with which we shall 
operate incorporates the empirical generaliza- 
tion that intense attitudes are more resistant 
to change than weakly held ones (cf. 1, 8, 9), 
but does so in a way which generates more 
detailed predictions. 


3. In terms of producing attitude change, 
the total pressure toward congruity is distrib- 
uted between the objects of judgment assoct- 
ated by an assertion in inverse proportion to 
their separate degrees of polarization. 


In other words, relatively less polarized 
objects of judgment, when associated with rela- 
tively more polarized objects of judgment, ab- 
sorb proportionately greater amounts of the 
pressure toward congruity, and consequently 
change more. 

Applying Principle 3 above, it is possible to 
predict relative attitude change according to 
the following formulas: 


doy, 
ix odes ai MARRE C: 
[doz | + [dosa] 
Idos, | 


AGo, — ——————— Po, (9 
ws Ido; as |dos,| T 


Coy, 


where AC refers to attitude change; where 


i s 
and doz, are taken at their absolute value 


d, 
ve are 


regardless of sign, and where Pos, and Pos, 
determined from equations 7 through 10. Thus, 
the sign of the right-hand side of the equation 
is always that of the particular P5, under con- 
sideration, and thus represents the direction of 
change. In example 1 in Figure 1, there is no 
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pressure and hence no change. In the other 
examples solid arrows indicate the direction 
and magnitude of predicted change. In ex- 
ample 2, the source must absorb twice as much 
pressure as the more polarized concept, and 
in a negative rather than a positive direction 
(e.g. BULGARIA —l sponsors HATE CAMPAIGN 
—3). The unexpected prediction here, that the 
even more unfavorable concept, HATE CAM- 
PAIGN, actually becomes a little less unfavorable 
under these conditions, derives directly from 
the theoretical model and will be discussed 
later. 

If we associate OJ, with the source of an 
assertion, and OJ, with the concept, the num- 
bers in Table 1 represent the results of apply- 
ing formula 12 to the prediction of attitude 
change toward the concept for all combinations 
of original attitude toward source and concept 
when the assertion is positive. 

Note that for all incongruous relations the 
predicted change is constant for a given original 
attitude toward source (upper right and lower 
left corners of matrix)—a highly favorable 
source favoring a negative concept produces 
just as much attitude change when that con- 
cept is —3 as when it is —1. This prediction 
assumes complete credulity of the message on 
the part of the receiver, a condition that exists 
only rarely, in all probability, for incongruous 
messages. Certainly, when presented with the 
incongruous message, EISENHOWER sponsors 
COMMUNISM, in an experimental situation, very 
few subjects are going to give it full credence. 
If we are going to make predictions, it is ap- 
parent that the variable of credulity must be 
taken into account. 


4. The amount of incredulity produced 
when one object of judgment is associated with 
another by an assertion is a positively accel- 
erated function of the amount of incongruity 
which exists and operates to decrease attitude 
change, completely eliminating change when 
maximal. 


Since incongruity exists only when similarly 
evaluated. concepts are associated by negative 
assertions or when oppositely evaluated con- 
cepts are associated by positive assertions, the 
correction for incredulity is limited to the up- 
per right and lower left corners of the matrix 
in Table 1. Within these situations, the amount 
of this correction is assumed to increase with 
the degree of incongruity, e.g., with the total 
pressure toward congruity available. It is as- 
sumed that no incongruity, and hence no in- 
credulity, can exist where one of the objects 
of judgment is neutral, e.g., EISENHOWER may 
come out either for or against a neutral con- 
cept like ST. LAWRENCE WATERWAY without the 
issue of incredulity arising. It is realized, of 
course, that other factors than those discussed 
here may affect incredulity. 

Figure 2 provides graphic illustration of the 
corrections made for incredulity. The original 
curves, level for the neutral point and beyond, 
derive from the upper right corner of Table 1 
and represent three degrees of favorable origi- 
nal attitude toward the source (OJ,) The 
dashed lines represent postulated incredulity, 
positively accelerated functions of total pres- 
sure toward congruity. The shape of this func- 
tion is, of course, based on pure hunch and 
will probably have to be modified; it simply 


TABLE 1. Predicted Attitude Change for Concept as a Function of 
Original Locations of Both Source and. Concept—Positive Assertion 


(Uncorrected for incredulity) 


Original Attitude Toward Concept 
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LU p XE = 
- O e N CQ 


Attitude change toward OJ 


*3 *2 +1 


0 1 — -3 


Initial attitude toward OJ» 


Figure 2. Predicted change in attitude toward OJ, and correction for incredulity 
(postulated incredulity indicated by dashed lines). 


seems reasonable that a person's tendency to 
reject a message will be relatively much less 
for slightly incongruous relations (e.g., EISEN- 
HOWER +3 praises BULGARIANS —1) than for 
grossly incongruous ones (e.g., EISENHOWER +3 
praises COMMUNISM —3). The light solid curves 


represent the result of subtracting incredulity 
functions from predicted attitude changes. The 
corrected values are given in Table 2. The same 
corrections, with appropriate regard to sign, 
apply to the lower left corner of Table 1 and 
are made in Table 2. For incongruous situa- 


" TABLE 2. Total Pressure Toward Congruity (Upper Numbers) and 
Predicted Changes in Attitude (Lower Numbers) as Corrected 
for Incredulity* 


(Positive assertion)** 


Initial 
Location 
of OJ, 


+1 +2 
+06 +15 

+2 0 +1 
00 +0.7 


any 0 
0.0 


Initial Location of OJ, 


+5 +6 
+1.5 0.0 


+38 +4 +5 
S18 418 +405 |'"+08 
+2 +3 +4 


+0.9 407 +04 


403 405 


* The OJ whose change is being predicted, either source or concept, is 
always OJ, and the other is OJy 
** When dealing with negative assertions, reverse the sign of OJ, and 
look in that row. 
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tions, then, the formulas for predicting attitude 
change become: 


RENE (18) 
m o— Á += 
"odi * 
(dsl . 
AC, ——— PE (14) 
|4,| + Id. 


where the sign of the second factor (i = correc- 
tion for incredulity) is always opposite to that 
of the first factor, i.e., of P, and thus serves to 
diminish the effect. The notations s and c refer, 
of course, to source and concept, respectively. 

It is to be emphasized that formulas 13 and 
14 apply only to situations of incredulity, or, 
to put it another way, i — 0 for all credulous 
situations. The precise nature of the i function 
remains to be empirically determined. The 
curve we have assumed for i approximates the 
function i—a(d,2--b)(dj--d), where the 
constants a and b are 1/40 and 1 respectively. 
It is clear, however, that i = f(d,,d,), and hence 
the attitude change is still a function of the 
two degrees of polarization and of direction 
of the assertion. 

Incredulity, to the extent it is present, will 
not only operate to "damp" changes in attitude 
but should also appear in expressions of dis- 
belief and rationalization. This makes it pos- 
sible to ascertain the incredulity function 
independent of attitude change. A proposed 
experiment along these lines might be as 
follows: If subjects are presented with a num- 
ber of messages and told that some are valid 
and others faked, we would expect the fre- 
quency of "fake" judgments to be some increas- 
ing function of the measured amount of 
incongruity (e.g, in terms of locations of origi- 
nal attitudes toward the associated objects of 
judgment). It is the shape of this function in 
which we would be particularly interested. If 
the same subjects were then assured that cer- 
tain of the “fake” messages were actually valid, 
we would expect to record rationalizations and 
other attempts to interpret the message with- 
out modifying the evaluative frame of refer- 
ence; e.g., told that RUssIA is actually sponsoring 
a PEACE CONFERENCE, the subject is likely to 
rationalize this event as some subterfuge on 
the part of the Soviets in the Cold War. Use 
of the same messages on another group of 
subjects in a standard pre- and postmessage 
attitude change design should serve to test the 
prediction that attitude change for incongruous 


assertions is damped in proportion to the de- 
gree of incredulity produced. 

This independence of credulity as a variable 
also means that it should be possible to ap- 
proximate the attitude change values in Table 
1 (uncorrected) under special conditions where 
credulity is made more probable. For example, 
if EISENHOWER were actually to invite important 
COMMUNIST PARTY OFFICERS to a friendly dinner 
at the White House, the effects on devaluating 
EISENHOWER and vice versa should be extreme. 
The greater effectiveness of “event” as com- 
pared to “word” propaganda follows directly. 

The lower numbers in each cell in Table 2 
represent the predicted magnitudes and direc- 
tions of attitude change for all combinations 
of original attitude—for sOURCES or CONCEPTS 
and for either positive assertions or negative 
assertions (see instructions in table footnotes) 
—as computed by applying formulas 11 and 
12, or 13 and 14 in the case of incongruous 
assertions. Let us take illustration 2 in Figure 
1 as an example: the original attitudes in this 
case are —1 (source) and —3 (concept), so we 
will be concerned with the cell defined by 
these values. Looking first for SOURCE CHANGE 
(e.g., SOURCE as O7,), we find a total pressure 
toward congruity of —2 (upper figure in cell) 
and a predicted attitude change of —1.5; look- 
ing then for CONCEPT CHANGE (€.g., CONCEPT as 
OJ,), we find a total pressure of +2 and a pre- 
dicted attitude change of +0.5. If this were the 
message BULGARIA praises COMMUNISM, we might 
expect a considerable increase in unfavorable- 
ness toward BULGARIA and a slight decrease in 
unfavorableness toward comMunisM. The reader 
may check the other illustrations against Table 
2 if he wishes. 

Predictions about attitude change are as- 
sumed to hold for any situation in which one 
object of judgment is associated with another 
by an assertion. In the special source-concept 
situation, we must take one additional factor 
into account—the fact that the assertion itself, 
whether positive or negative, typically applies 
to the concept rather than to the source. When 
X praises Y, the favorable effect of "praise" 
applies chiefly to Y; when X denounces x. 
similarly, the unfavorable effect of “denounce” 
applies chiefly to Y. In other words, we must 
add to the equation predicting attitude change 
for concepts a constant (+A) whose sign is 
always the same as that of the assertion. In 
the data we have available, the existence of 
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such an assertion constant applying to the con- 
cept but not the source is clearly evident. 


A TEST OF THE CONGRUITY PRINCIPLE’ 


On the basis of a pretest of 36 potential 
objects of judgment, three source-concept pairs 
were selected which met the criteria of (a) 
approximately equal numbers of subjects hold- 
ing favorable, neutral, and unfavorable original 
attitudes toward them, and (b) lack of corre- 
lation between attitude toward the source and 
the concept making up each pair. The three 
source-concept pairs finally selected were: LABOR 
LEADERS with LEGALIZED GAMBLING, CHICAGO 
TRIBUNE With ABSTRACT ART, and SENATOR ROB- 
ERT TAFT with ACCELERATED COLLEGE PROGRAMS. 
Another group of 405 college students was 
given a before-test, in which the 6 experimental 
concepts along with 4 "filler" concepts were 
judged against a form of the semantic differ- 
ential including 6 scales highly loaded on the 
evaluative factor. The sum of ratings on these 
six 7-step scales constituted the attitude score 
for each concept, these scores ranging from 6 
(most unfavorable) to 42 (most favorable). Five 
weeks later the same subjects were given highly 
realistic newspaper stories including positive 
or negative assertions involving the experi- 
mental source-concept pairs. Immediately after- 
ward the subjects were given the after-test, 
again judging the same concepts against the 
semantic differential. 

Original attitudes toward each source and 
concept were determined from the before-test 
Scores, subjects being distributed into nine cells, 
$404, $4 Cy, S4C, Soc,, etc. Attitude change 
amounts, for both source and concept, were 
obtained by subtracting the before-test score 
from the after-test score for each subject, a 
positive value thereby indicating increased 
favorableness. 

Table 3 compares predicted attitude change 
Scores (upper number in each cell) with ob- 
tained attitude change scores (lower number in 
each cell) for both sources and concepts and 
for both positive and negative assertions. The 
predicted values represent the algebraic mean 
of the attitude change scores in appropriate 
cells of Table 2 (e.g. the value for s, c, with 
a positive assertion equals the average for the 
nine cells in the upper left corner); the ob- 


* This experiment is described in detail in a sepa- 
rate report by one of the authors (9). 


TABLE 3. Predicted (Upper Values in Cells) 
and Obtained (Lower Values in Cells) 
Changes in Attitude 


Positive Assertions Negative Assertions 


Cree Original Attitude Original Attitude 
Toward toward Concept toward Concept 
Source + 0 — T 0 - 
Source Changes 
+ + 0.2 00 —11 —1.1 0.0 4-02 
+2 +16 — 42 —45 + 1 +8 
0 + 2.0 00 —20 —20 0.0 420 
+150 +2 — 94 — 68 + 17 +96 
= + 1.1 00 —02 —0.2 0.0 +11 
+49 +13 — 7 —38 — 8 +H 
Concept Changes 
E +02 +20 +11 —11 —20 —02 
+ 51 +245 +107 — 88 —180 — 39 
0 0.0 0.0 0.0 0.0 0.0 0.0 


+ 39 +80 +48 —72 — 79 — 34 
—20 —02 +02 +20 +11 
— 94 — 52 — 10 +19 + 22 +16 


tained values represent the total attitude 
change scores, summed algebraically, for 45 
subjects (15 subjects on each of 3 stories) on 
6 evaluative scales. The reason for the gross 
difference in absolute magnitudes of predicted 
and obtained scores is therefore that the for- 
mer are expressed in scale units and the latter 
in group totals. The general correspondence 
between predicted and obtained directions of 
attitude change is apparent from inspection 
of Table 3. In every case predicted positive - 
changes (+) and predicted negative changes 
(—) show corresponding signs in the obtained 
data, and predicted lack of change (0) generally 
yields obtained changes of small magnitude. 
The predictions obviously hold better for 
source changes than for concept changes, 
it will be recalled that it was also predicted 
that an assertion constant (+A) would apply 
to the concept but not the source. This would 
mean that for comparable situations (e.g 54.60 
VS. SC, SoC VS. $ 6g, etc.) concept changes 
should be more in the favorable direction than 
source changes for positive assertions and more 
in the negative direction for negative asser 
tions. Table 4 provides a test of this prediction. 
As can be seen, when comparable conditions 
for source and concept changes are arran| 
the differences in magnitudes of attitude 
change are regularly positive for positive asset 
tions (concept changes more toward favorable 
direction) and regularly negative for negative 
assertions (concept changes more toward un- 
favorable direction). With 17 of the 18 values 
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TABLE 4. Effects of Assertion Itself (A) 


Source Concept Difference (A) 
———— 
Positive Assertions 
(Predicted that A is +) 
$4C, + 25 s,c, + 5l 4 26 
Soc, +150 $4€9 + 245 + 95 
sc, + 49 s,c.. +107 + 58 
Sco + 16 Soc} + 39 + 23 
Soco + 25 Soco + 80 + 55 
s + 18 Soc. + 48 + 35 
$,c_— 42 Sic, — 24 + 18 
Soc. — 94 sey — 52 + 42 
s GET S s c — 10 —- 3 
Negative Assertions 
(Predicted that A is —) 
sc} — 45 s c — 88 — 48 
soc} — 68 540g — 180 —112 
s.c, — 33 s.c. — 39 — 6 
$46, + 1 $g€4. — 72 — 75 
Solo + 7 Soco — 79 — 96 
St, — 9 soc. — 34 — 31 
s,c 4 34 s.c, + 19 — 15 
Spt = 106 sao + 22 — 74 
s_c_+ 34 s_c_+ 16 — 18 


in the predicted direction, this is obviously 
significant. 

A rough estimate of the size of this constant 
can be obtained from the average difference 
between source and concept changes for com- 
parable situations: it turns out to be A=+ 46 
in total change score or 4 = .17 in units of 
the 7-step attitude scale employed. In other 
words, under the general conditions of this 
experiment, the assertion constant applied to 
concept changes equals about 4 of a scale unit. 

That the magnitudes of attitude changes as 
well as their directions tend to follow predic- 
tions is also evident by inspection of Table 3. 
This can be seen more clearly in the correlation 
plot between predicted changes and obtained 
changes given as Figure 3. The predicted values 
are here treated as categorical and the obtained 
as continuous, and the latter have been cor- 
rected for the assertion constant by adding 46 
to concept changes with negative assertions and 
subtracting 46 from concept changes with posi- 
tive assertions. The correlation between pre 
dicted and obtained changes in attitude is high 
(r = 91). 

A number of corollaries derive from the con- 
gruity principle, some of which can be tested 


+141-180 


+101-140 


+61-100 


+21-60 


* Source changes 
o Concept changes 


-20 -11-02 00 402 +1.1 420 
Expected 
Figure 3. Correlation plot between predicted and 
obtained attitude changes after corrections for in- 
credulity and for constant effect of assertion. 


against Tannenbaum's data and others of which 
cannot. They are as follows: 


l. Shifts in evaluation always tend toward 
equalization of the degrees of polarization of 
the objects of judgment associated by an asser- 
tion. If two unequally polarized concepts are 
associated, the less polarized one becomes more 
so and the more polarized less so; if a neutral 
concept is associated with a polarized one, it 
always becomes more polarized. In Tannen- 
baum's data this means that the less polarized 
object of judgment (neutral) should always 
change more than the more polarized object 
of judgment (plus or minus); this holds for all 
relevant conditions except one (s9¢,, negative 
assertion), and even here when correction for 
the assertion. constant is made. The compari- 
sons shown in Table 5, where concept changes 
have been corrected for the assertion constant, 
clearly substantiate this prediction—in every 
case the neutral member shows a large shift 
and the polar member a small shift. 

2. When the total pressure toward congruity 
is constant, it is easier to make an object of 
judgment more polarized than less so. That 
this prediction follows from the theory can be 
seen in Table 2 by comparing magnitudes of 
attitude change within the same row or column 
(for a given location of source or concept) 
when the total pressure is the same in amount 
but opposite in sign—amount of attitude 
change is always larger in the same direction 
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TABLE 5. Comparison of Attitude Changes 
for Neutral and Polar Objects of Judgment 


Change Change 
for for 
Situation Neutral Polar 

Positive Assertions 

SoC. +150 (s) — 7 (c) 

$4.69 4-199 (c) +16 (s) 

$..69 — 98 (c) +13 (s) 

Soe — 94 (s) + 2(c) 
Negative Assertions 

SoC — 68 (5) —26 (c) 

$4.69 —134 (c) + 1() 

$69 + 48 (c) — 3(s) 

Soc + 96 (s) +12 (c) 


as the sign of the row or column. This cannot 
be checked in Tannenbaum's data because he 
does not differentiate between degrees of origi- 
nal attitude in the same direction. It would 
be expected, however, from our general notion 
that evaluative frames of reference tend toward 
maximum simplicity. 

3. Attitude change toward an object of judg- 
ment is an inverse function of intensity of 
original attitude toward that object. That 
weakly held attitudes are more susceptible to 
ae is a widely held notion, but it is only 
valid, according to the present theory, for the 
average of all degrees of attitude toward the 
other object with which a given one is associ- 
ated regardless of sign (the absolute means, 
| M |, given at the bottom of Table 2) or for 
maximally polarized attitudes toward the other 
object (+8 and —3 rows in Table 2). The "law" 
definitely does not hold for other degrees of 
attitude toward the second object of judgment, 
as can be seen by inspection of this table. The 
data in Table 6 compare predicted values (ab- 
solute means at bottom of Table 2) with those 
obtained by Tannenbaum, both being expressed 
in attitude scale units in this case. The close 
correspondence in trend is apparent, and the 
obtained trend is statistically significant. The 
difference in absolute magnitude presumably 
represents the limited effect of a single message 
upon attitude change. The theoretical model 
as developed so far takes account of neither 
learning via successive messages nor of inten- 
sity of assertions. 

4. Attitude change for a givem object of 
judgment in the direction of the assertion is 


TABLE 6. Attitude Change Toward an Object 


of Judgment as a Function of Original 
Attitude Toward that Object Itself 


Original Attitude Toward 
Object of Judgment 
+ 0 - 
Predicted 0.6 1.7 0.6 
Obtained 
(source) 0.2 0.5 0.2 
Obtained 
(concept) 0.3 0.7 0.2 


an approximately linear function of the favor- 
ableness of the original attitude toward the 
other object of judgment with which it is 
associated. The more favorable the attitude 
toward a source, the greater the effect of a 
positive assertion on raising attitude toward 
the concept and the greater the effect of a 
negative assertion upon lowering attitude to- 
ward the concept. Strongly unfavorable sources 
have just the opposite effects. The same state- 
ments hold for changes in attitude toward 
sources when original attitudes toward concepts 
are varied. Table 7 compares predicted and 
obtained values. Again, the generally lower 
levels of obtained changes as compared with 
predicted changes are presumably due to the 
limited effects of a single message. Changes in 


TABLE 7. Attitude Change Toward One 
Object of Judgment as a Function of Original 
Attitude Toward the Other Object 

of Judgment 


SSF 
Original Attitude 
toward Other 
Object 


Assertions + 


Positive 


Predicted +08 0.0 —08 
Obtained 

(source) 40.5 0.0 —0.2 
Obtained 

(concept) +03 0.0 —0.3 

Negative 

Predicted —0.8 0.0 +08 
Obtained 

(source) —0.2 0.0 402 
Obtained 

(concept) —0.2 —0.1 4.02 


Ff 
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attitude have been corrected for the assertion 
constant, here as in Table 6. The obtained 
functions are in the direction predicted, and 
their trend is statistically significant. 

5. Whenever a congruent assertion associ- 
ates two differently polarized objects of judg- 
ment, and neither of them is neutral, the more 
polarized object of judgment becomes less so. 
This deduction includes the rather paradoxical 
situations noted earlier in this paper in which, 
for example, a highly favorable source comes 
out in favor of somewhat less favorable con- 
cept and becomes slightly less favorable itself 
in doing so, according to theory. The locations 
in Table 2 where this prediction arises are 
italicized. Here we are forced to predict, for 
example: when EISENHOWER +3 praises GOLFING 
+1, he loses a little prestige while giving a big 
boost to the concept; when EISENHOWER +3 
denounces comic Books —1, he may make the 
latter considerably more unfavorable, but he 
loses a little ground himself in the exchange. 
It is as if a highly favorable source should 
only favor equally good things or be against 
extremely bad things and a highly unsavory 
concept should be only sponsored by equally 
unsavory sources or condemned by highly noble 
sources. 


If such an effect could be demonstrated, it 
would be convincing evidence for the whole 
theory. Tannenbaum's study provides only a par- 
tial test for this phenomenon in that the experi- 
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SUMMARY 


This paper describes a general theory of atti- 
tude change which takes into account original 
attitude toward the source of a message, origi- 
nal attitude toward the concept evaluated by 
the source, and the nature of the evaluative 
assertion, Predicted changes in attitude toward 
both source and concept are based upon the 
combined operation of a principle of con- 
gruity, a principle of susceptibility as a function 
of polarization, and a principle of resistance 
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34. Structural Balance: A Generalization of Heider’s Theory 


DORWIN CARTWRIGHT and FRANK HARARY 


A persistent problem of psychology has been 
how to deal conceptually with patterns of inter- 
dependent properties. This problem has been 
central, of course, in the theoretical treatment 
by Gestalt psychologists of phenomenal or neu- 
ral configurations or fields (12, 13, 15). It has 
also been of concern to social psychologists and 
sociologists who attempt to employ concepts 
referring to social systems (18). 

Heider (19), reflecting the general field-theo- 
retical approach, has considered certain aspects 
of cognitive fields which contain perceived 
people and impersonal objects or events. His 
analysis focuses upon what he calls the P-O-X 
unit of a cognitive field, consisting of P (one 
person), O (another person), and X (an imper- 
sonal entity). Each relation among the parts of 
the unit is conceived as interdependent with 
each other relation. Thus, for example, if P 
has a relation of affection for O and if O is 
seen as responsible for X, then there will be 
a tendency for P to like or approve of X. If 
the nature of X is such that it would “nor- 
mally” be evaluated as bad, the whole P-O-X 
unit is placed in a state of imbalance, and 
pressures will arise to change it toward a state 
of balance. These pressures may work to 
change the relation of affection between P and 
O, the relation of responsibility between O 


and X, or the relation of evaluation between 
P and X. 


The purpose of this paper is to present and 
develop the consequences of a formal definition 
of balance which is consistent with Heider's 
conception and which may be employed in a 
more general treatment of empirical configu- 


* Reprinted from Psychological Review, 1956, 63, 
277-293, with permission of the senior author and 
the American Psychological Association. This paper 
was prepared as part of a project sponsored in the 
Research Center for Group Dynamics by the Rocke- 
feller Foundation, 
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rations. The definition is stated in terms of 
the mathematical theory of linear graphs (8, 
14) and makes use of a distinction between a 
given relation and its opposite relation. Some 
of the ramifications of this definition are then 
examined by means of theorems derivable from 
the definition and from graph theory. 


HEIDER'S CONCEPTION OF BALANCE 


In developing his analysis of balanced cog- 
nitive units, Heider distinguishes between the 
two major types of relations. The first concerns 
attitudes, or the relation of liking or evaluat- 
ing. It is represented symbolically as L when 
positive and as ~L when negative. Thus, PLO 
means P likes, loves, values, or approves O, and 
P~LO means P dislikes, negatively values, or 
disapproves O. The second type of relation 
refers to cognitive unit formation, that is, to 
such specific relations as similarity, possession, 
causality, proximity, or belonging. It is written 
as U or ~U. Thus, according to Heider, PUX 
means that P owns, made, is close to, or is asso- 
ciated with X, and P~UX means that P does 
not own, did not make, or is not associated 
with X. 

A balanced state is then defined in terms of 
certain combinations of these relations. The 
definition is stated separately for two and for 
three entities. 


In the case of two entities, a balanced state 
exists if the relation between them is positive 
(or negative) in all respects, i.e, in regard to 
all meanings of L and U . .. . In the case of 
three entities, a balanced state exists if all three 
relations are positive in all respects, or if two 
are negative and one positive (9, p. 110). 


These are examples of balanced states: P likes 
something he made (PUX, PLX); P likes what 
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his friend likes (PLO, OLX, PLX); P dislikes 
what his friend dislikes (PLO, O~LX, P~LX); 
P likes what his enemy dislikes (P~LO, O~LX, 
PLX); and P's son likes what P likes (PUO, 
PLX, OLX). 

Heider's basic hypothesis asserts that there is 
a tendency for cognitive units to achieve a 
balanced state. Pressures toward balance may 
produce various effects. 


If no balanced state exists, then forces towards 
this state will arise. Either the dynamic charac- 
ters will change, or the unit relations will be 
changed through action or through cognitive 
reorganization. If a change is not possible, the 
state of imbalance will produce tension (9, pp. 
107-109). 


The theory, stated here in sketchy outline, 
has been elaborated by Heider so as to treat 
a fuller richness of cognitive experience than 
would be suggested by our brief description. 
It has been used, too, by a number of others 
as a point of departure for further theoretical 
and empirical work. We shall summarize 
briefly some of the major results of this work. 

Horowitz Lyons, and Perlmutter (10) at- 
'tempted to demonstrate tendencies toward bal- 
ance in an experiment employing members of 
a discussion group as subjects. At the end of 
a discussion period each subject was asked to 
indicate his evaluation of an event (PLX or 
P~LX) which had occurred during the course 
of the discussion. The event selected for evalu- 
ation was one which would be clearly seen as 
having been produced by a single person 
(OUX). The liking relation. between each P 
and O (PLO or P~LO) had been determined 
by a sociometric questionnaire administered 
before the meeting. Would P's evaluation of 
the event be such as to produce a balanced 
P-O-X unit? If so, P's evaluation of O and X 
should be of the same sign. The experimental 
data tend to support the hypothesis that a 
P-O-X unit tends toward a balanced state! 

The social situation of a discussion group 
1 One of the attractive. features of this study is 
that it was conducted in a natural “field” setting, 
thus avoiding the dangers of artificiality. At the 
same time the setting placed certain restrictions on 
the possibility of manipulation and control of the 
variables, The data show a clear tendency for P to 
place a higher evaluation on Xs produced by more 
attractive Os. It is not clearly demonstrated that P 
likes Xs produced by liked Os and dislikes Xs pro- 
duced by disliked Os. 


can be better analyzed, according to Horowitz, 
Lyons, and Perlmutter, by considering a some- 
what more complex cognitive unit. The evalu- 
ation of X made by P, they argue, will be 
determined not only by P's evaluation of O 
but also by his perception of the evaluation of 
X given by others (Qs) in the group. The basic 
unit of such a social situation, then, consists 
of the subject, a person who is responsible for 
the event, and another person who will be seen 
by the subject as supporting or rejecting the 
event. This is called a P-O-Q-X unit. The addi- 
tional data needed to describe these relations 
were obtained from the sociometric question- 
naire which indicated P's evaluation of Q 
(PLQ or P~LQ), and from a question designed 
to reveal P's perception of Q's support or rejec- 
tion of X, treated by the authors as a unit 
relation (QUX or Q-UX).? 

Although these authors indicate the possi- 
bility of treating the P-O-Q-X unit in terms of 
balance, they do not develop a formal defi- 
nition of a balanced configuration consisting 
of four elements. They seem to imply that the 
P-O-Q-X unit will be balanced if the P-O-X 
and the P-Q-X units are both balanced. They 
do not consider the relation between Q and O, 
nor the logically possible components of which 
it could be a part. Their analysis is concerned 
primarily with the two triangles (P-O-X and 
P-Q-X), which are interdependent, since both 
contain the relation of P's liking of X. We 
noted above that the data tend to support the 
hypothesis that the P-O-X unit will tend toward 
balance. The data even more strongly support 
the hypothesis when applied to the P-Q-X unit; 
P's evaluation of X and his perception of Q's 
attitude toward X tend to agree when P likes 
Q, and to disagree when P dislikes Q. It should 
be noted, however, that there was also a clear 
tendency for P to see Q's evaluation of X as 
agreeing with his own whether or not he likes 


In a rather different approach to the ques- 
tion of balanced P-O-X units, Jordan (11) pre- 
sented subjects with 64 different hypothetical 
situations in which the L and U relations 
between each pair of elements was systemat- 
ically varied. The subject was asked to place 
himself in each situation by taking the part of 


2 Whether this relation should be treated as U or 
L is subject to debate. For testing Heider's theory 
of balance, however, the issue is irrelevant, since 
he holds that the two relations are interchangeable 
in defining balance. 
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P, and to indicate on a scale the degree of 
pleasantness or unpleasantness he experienced. 
Unpleasantness was assumed to reflect the pos- 
tulated tension produced by imbalanced units. 
Jordan's data tend to support Heider's hypoth- 
esis that imbalanced units produce a state of 
tension, but he too found that additional fac- 
tors need to be considered. He discovered, for 
example, that negative relations were experi- 
enced as unpleasant even when contained in 
balanced units. This unpleasantness was par- 
ticularly acute when P was a part of the nega- 
tive relation. Jordan's study permits a detailed 
analysis of these additional influences, which 
we shall not consider here. 

Newcomb (17), in his recent theory of inter- 
personal communication, has employed con- 
cepts rather similar to those of Heider. He 
conceives of the simplest communicative act as 
one in which one person A gives information 
to another person B about something X. The 
similarity of this A-B-X model to Heider's 
P-O-X unit, together with its applicability to 
objective interpersonal relations (rather than 
only to the cognitive structure of a single 
person) may be seen in the following quota- 
tions from Newcomb: 


A-B-X is . . . regarded as constituting a system. 
That is, certain definable relationships between 
A and B, between A and X, and between B 
and X are all viewed as interdependent. . . . 
For some purposes the system may be regarded 
as a phenomenal one within the life space of 
A or B, for other purposes as an “objective” 
system including all the possible relationships 
as inferred from observations of A’s and B's 
behavior (17, p. 393). 


Newcomb then develops the concept of 
"strain toward symmetry," which appears to 
be a special instance of Heider's more general 
notion of "tendency toward balance." "Strain 
toward symmetry" is reflected in several mani- 
festations of a tendency for A and B to have 
attitudes of the same sign toward a common 
X. Communication is the most common and 
usually the most effective manifestation of this 
tendency. 

By use of this conception Newcomb reinter- 
prets several studies (1, 4, 5, 16, 20) which have 
investigated the interrelations among inter- 
personal attraction, tendencies to communicate, 
pressures to uniformity of opinion among mem- 
bers of a group, and tendencies to reject devi- 


ates. The essential hypothesis in this analysis 
is stated thus: 


If A is free either to continue or not to con- 
tinue his association with B, one or the other 
of two eventual outcomes is likely: (a) he 
achieves an equilibrium characterized by rela- 
tively great attraction toward B and by rela- 
tively high perceived symmetry, and the 
association is continued; or (b) he achieves an 
equilibrium characterized by relatively little 
attraction toward B and by relatively low per- 
ceived symmetry, and the association is dis- 
continued (17, p. 402). 


Newcomb's outcome a is clearly a balanced 
state as defined by Heider. Outcome b cannot 
be unambiguously translated into Heider's 
terms. If by “relatively little attraction toward 
B" is meant a negative L relation between 4 
and B, then this outcome would also seem to 
be balanced. Newcomb's "continuation or dis- 
continuation of the association between 4 and 
B" appear to correspond to Heider's U and 
~U relations. 


STATEMENT OF THE PROBLEM 


This work indicates that the tendency toward 
balance is a significant determinant of cognitive 
organization, and that it may also be important 
in interpersonal relations. The concept of bal- 
ance, however, has been defined so as to apply 
to a rather limited range of situations, and it 
has contained certain ambiguities. We note 
five specific problems. 


1, Unsymmetric relations. Should all rela- 
tions be conceived as symmetric? The answer 
is clearly that they should not; it is possible 
for P to like O while O dislikes P. In fact, 
Tagiuri, Blake, and Bruner (21) have inten- 
sively studied dyadic relations to discover con- 
ditions producing symmetric relations of actual 
and perceived liking. Theoretical discussions of 
balance have sometimes recognized this possi- 
bility—Heider, for example, states that unsym- 
metric liking is unbalanced—but there has 
been no general definition of balance which 
covers unsymmetric relations. The empirical 
studies of balance have assumed that the rela- 
tions are symmetric. 

2. Units containing more than three entities. 
Nearly all theorizing about balance has referred 
to units of three entities. While Horowitz, 
Lyons, and Perlmutter studied units with four 
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entities, they did not define balance for such 
cases. It would seem desirable to be able to 
speak of the balance of even larger units. 

3. Negative relations. Is the negative relation 
the complement of the relation or its opposite? 
All of the discussions of balance seem to equate 
these, but they seem to us to be quite different, 
for the complement of a relation is expressed 
by adding the word "not" while the opposite 
is indicated by the prefix "dis" or its equivalent. 
Thus, the complement of "liking" is "not lik- 
ing"; the opposite of “liking” is “disliking.” In 
general, it appears that —L has been taken to 
mean "dislike" (the opposite relation) while 
~U has been used to indicate “not associated 
with" (the complementary relation). Thus, for 
example, Jordan says: "Specifically, '+L' sym- 
bolizes a positive attitude, ‘—L’ symbolizes a 
negative attitude, ‘+U’ symbolizes the existence 
of unit formation, and '—U' symbolizes the lack 
of unit formation" (11, p. 274). 

4. Relations of different types. Heider has 
made a distinction between two types of rela- 
tions—one based upon liking and one upon 
unit formation. The various papers following 
up Heider's work have continued to use this 
distinction. And it seems reasonable to assume 
that still other types of relations might be des- 
ignated. How can a definition of balance take 
into account relations of different types? Heider 
has suggested some of the ways in which liking 
and unit relations may be combined, but a gen- 
eral formulation has yet to be developed. 

5. Cognitive fields and social systems. Hei- 
der's intention is to describe balance of cogni- 
tive units in which the entities and relations 
enter as experienced by a single individual. 
Newcomb attempts to treat social systems which 
may be described objectively. In principle, it 
should be possible also to study the balance of 
sociometric structures, communication networks, 
patterns of power, and other aspects of social 
systems. 


We shall attempt to define balance so as to 
overcome these limitations. Specifically, the 
definition should (a) encompass unsymmetric 
relations, (b) hold for units consisting of any 
finite number of entities, (c) preserve the dis- 
tinction between the complement and the op- 
posite of a relation, (d) apply to relations of 
different types, and (e) serve to characterize 
cognitive units, social systems, or any configu- 
ration where both a relation and its opposite 
must be specified. 


THE CONCEPTS OF GRAPH, DIGRAPH, AND 
SIGNED GRAPH 


Our approach to this problem has two pri- 
mary antecedents: (a) Lewin's treatment (15) of 
the concepts of whole, differentiation, and unity, 
together with Bavelas’ extension (2) of this 
work to group structure; and (b) the mathe- 
matical theory of linear graphs. 

Many of the graph-theoretic definitions given 
in this section are contained in the classical 
reference on graph theory, Kénig (14), as well 
as in Harary and Norman (8). We shall discuss, 
however, those concepts which lead up to the 
theory of balance. 

A linear graph, or briefly a graph, consists of 
a finite collection of points’ A, B, C, . . . to- 
gether with a prescribed subset of the set of all 
unordered pairs of distinct points. Each of these 
unordered pairs, AB, is a line of the graph. 
(From the viewpoint of the theory of binary 
relations a graph corresponds to an irreflex- 
ive symmetric relation on points 4, B, C, . . . . 
Alternatively a graph may be represented as a 
matrix.) 

Figure ! depicts a graph of four points and 
four lines. The points might represent people, 


A B 


D c 
Figure 1. A linear graph of four points and four 
lines. The presence of line AB indicates the exist- 
ence of a specified symmetric relationship between 
the two entities A and B. 


and the lines some relationship such as mutual 
liking. With this interpretation, Figure 1 indi- 
cates that mutual liking exists between those 
pairs of people 4, B, C, and D joined by lines. 
Thus D is in the relation with all other persons, 
while C is in the relation only with D. Figure 1 
could be used, of course, to represent many 


3 Points are often called “vertices” by mathemati- 
cians and “nodes” by electrical engineers. 

4 This is the approach used by Heider. 

5 A relation is irreflexive if it contains no ordered 
pairs of the form (a, a), ie., if no element is in 
this relation to itself. 

6 This treatment is discussed in Festinger (3). The 
logical equivalence of relations, graphs, and ma- 
trices is taken up in Harary and Norman (8). 
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other kinds of relationships between many 
other kinds of entities. 

It is apparent from this definition of graph 
that relations are treated in an all-or-none man- 
ner, i.e., either a relation exists between a given 
pair of points or it does not. Obviously, how- 
ever, many relationships of interest to psychol- 
ogists (liking, for example) exist in varying 
degrees. This fact means that our present use 
of graph theory can treat only the structural, 
and not the numerical, aspects of relations. 
While our treatment is thereby an incomplete 
representation of the strength of relations, we 
believe that conceptualization of the structural 
properties of relations is a necessary first step 
toward a more adequate treatment of the more 
complex situations. Such an elaboration, how- 
ever, goes beyond the scope of this paper. 

A directed graph, or a digraph, consists of 
a finite collection of points together with a pre- 
scribed subset of the set of all ordered pairs of 
distinct points. Each of these ordered pairs AB 
is called a line of the digraph. Note that the 
only difference between the definitions of graph 
and digraph is that the lines of a graph are 
unordered pairs of points while the lines of a 
digraph are ordered pairs of points. An ordered 
fair of points is distinguished from an unor- 
dered pair by designating one of the points as 
the first point and the other as the second. 
Thus, for example, the fact that a message can 
go from A to B is represented by the ordered 
pair (4, B), or equivalently, by the line AB, as 
in Figure 2. Similarly, the fact that A and D 
choose each other is represented by the two 
directed lines AD and DA. 

A signed graph, or briefly an s-graph, is ob- 
tained from a graph when one regards some of 
the lines as positive and the remaining lines 
as negative. Considered as a geometric represen- 
tation of binary relations, an s-graph serves to 


A B 


D [^] 


Figure 2. A directed graph of four points and five 
directed lines. An AB line indicates the existence 
of a specified ordered relationship involving the two 
entities A and B. Thus, for example, if A and B 
are two people, the line might indicate that a 
message can go from A to B or that A chooses B. 


depict situations or structures in which both a 
relation and its opposite may occur, e.g., like 
and dislike. Figure 3 depicts an sgraph, em- 
ploying the convention that solid lines are posi- 
tive and dashed lines negative; thus A and B 
are represented as liking each other while 4 
and C dislike each other. 


B [^] D 


A 


Figure 3. A signed graph of four points and five 
lines. Solid lines have a positive sign and dashed 
lines a negative sign. If the points stand for people 
and the lines indicate the existence of a liking rela- 
tionship, this s-graph shows that A and B have a 
relationship of liking, A and C have one of dislik- 
ing, and B and D have a relationship of indiffer- 
ence (neither liking nor disliking). 


Combining the concepts of digraph and s- 
graph, we obtain that of an s-digraph. A signed 
digraph, or an s-digraph, is obtained from a 
digraph by taking some of its lines as positive 
and the rest as negative. 

A graph of type 2 (8), introduced to depict 
structures in which two different relations de- 
fined on the same set of elements occur, is 
obtained from a graph by regarding its lines as 
being of two different colors (say), and by per- 
mitting the same pair of points to be joined 
by two lines if these lines have different colors. 
A graph of type x, v-1,2,8, . . . , is defined 
similarly. In an s-graph or s-digraph of type 2, 
there may occur lines of two different types in 
which a line of either color may be positive or 
negative. An example of an sgraph of type 2 
might be one depicting for the same P-O-X 
unit both U and L relations among the entities, 
where the sign of these relations is indicated. 

A path is a collection of lines of a graph of 
the form AB, BC, ..., DE, where the points 
A, B, C, . . . , D, E, are distinct. A cycle con- 
sists of the above path together with the line 
EA. The length of a cycle (or path) is the num- 
ber of lines in it; an n-cycle is a cycle of length 
n. Analogously to graphs, a path of a digraph 
consists of directed lines of the form AB, BC, 
..., DE, where the points are distinct. A cycle 
consists of this path together with the line EA. 
In the later discussion of balance of an s- 
digraph we shall use the concept of a semicycle. 
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We illustrate semicycles with the digraph of 


cr HB Be E ca 2b. 


di 

Wi. The last two of these semicycles are not 
cycles. Note that every cycle is a semicycle, and 
a semicycle of length 2 is necessarily a cycle. 


BALANCE 


With these concepts of digraphs, and signed 
graphs we may now develop a rigorous general- 
ization of Heider's concept of balance. 

It should be evident that Heider's terms, en- 
tity, relation, and sign of a relation may be 
coordinated to the graphic terms, point, di- 
rected line, and sign of a directed line. ‘Thus, 
for example, the assertion that P likes O (PLO) 
may be depicted as a directed line of positive 
sign PO. It should also be clear that Heider's 
two different kinds of relations (L and U) may 
be treated as lines of different type. It follows 
that a graphic representation of a P-O-X unit 
having positive or negative L and U relations 
will be an s-digraph of type 2. 

For simplicity of discussion we first consider 
the situation containing only symmetric rela- 
tions of a single type (i.e. an sgraph of type 1). 
Figure 4 shows four such sgraphs. It will be 
noted that each of these sgraphs contains one 
cycle: AB, BC, CA. We now need to define the 
sign of a cycle. The sign of a cycle is the prod- 
uct of the signs of its lines. For convenience 
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Figure 4. Four s-graphs of three points and three 
lines each. Structures (a) and (b) are balanced, but 
(c)and (d) are not balanced. 


we denote the sign of a line by +1 or —1 when 
it is positive or negative. With this definition 
we we that the cycle, AB, BC, CA is positive 
in sgraph (a) («ls 41-41), positive in + 
graph (b) (+1 + —1* —1), negative in &graph 
(0) (V * +I- — 1), and negative in sgraph (d) 
(71: —1: —1). To generalize, a cycle is posi- 
tive if it contains an even number of negative 
lines, and it is negative otherwise, Thus, in 
particular, a cycle containing only positive lines 
is positive, since the number of negative lines 
is zero, an even number. 

In discusing the concept of balance, Heider 
states (see 9, p. 110) that when there are three 
entities a balanced state exists if all three rela- 
tions are positive or if two are negative and 
one positive. According to this definition, s 
graphs (a) and (b) are balanced while sgraphs 
(c) and (d) are not (Figure 4). We note that in 
the examples cited Heider's balanced state is 
depicted as an sgraph of three points whose 
cycle is positive. 

In generalizing Heider's concept of balance, 
we propose to employ this characteristic of bal- 
anced states as a general criterion for balance 
of structures with any number of entities. Thus 
we define an sgraph (containing any number 
of points) as balanced if all of its cycles are 

Figure 5 illustrates this definition for four s- 
graphs containing four points. In each of these 
sgraphs there are seven cycles: 4B, BC, CA; 
AB, BD, DA; BC, CD, DB; AC, CD, DA; AB, 
BC, CD, DA; AB, BD, DC, CA; and BC, CA, 
AD, DB. It will be seen that in sgraphs (a) and 
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Figure 5. Four s-graphs containing four points and 
six lines each. Structures (a) and (b) are balanced, 
but (c) and (d) are not balanced. 
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(b) all seven cycles are positive, and these s- 
graphs are therefore balanced. In s-graphs (c) 
and (d) the cycle, AB, BC, CA, is negative (as 
are several others), and these s-graphs are there- 
fore not balanced. It is obvious that this defi- 
nition of balance is applicable to structures 
containing any number of entities.7 

The extension of this definition of balance 
to s-digraphs containing any number of points 
is straightforward. Employing the same defini- 
tion of sign of a semicycle for an s-digraph as 
for an ordinary s-graph, we similarly define an 
s-digraph as balanced if all of its semicycles are 
positive. 

Consider now Heider's P-O-X unit, contain- 
ing two persons P and O and an impersonal 
entity X, in which we are concerned only with 
liking relations. Figure 6 shows three of the 


Figure 6. Three sdigraphs representing Heider's 
P-O-X units. Only structure (b) is balanced. 


possible 3-point s-digraphs which may represent 
such P-O-X units. A positive line means 
that P likes O, a negative PO line means that 
P dislikes O, We assume that a person can like 
or dislike an impersonal entity but that an im- 
personal entity can neither like nor dislike a 
person.* We also rule out of consideration here 


7 If an s-graph contains no cycles, we say that it is 
"vacuously" balanced, since all (in this case, none) 
of its cycles are positive. 

* In terms of digraph theory we define an object 
as a point with zero output, Thus a completely in- 
different person is an object. If, psychologically, an 
impersonal entity is active and likes or dislikes a 
Person or another impersonal entity, then in terms 
of digraph theory it is not an object. 


“ambivalence,” where a person may simulta- 
neously like and dislike another person or im- 
personal entity. 

In each of these_s-digraphs there are three 
semicycles: PO, OP; PO, Ox, XP; and PO, 
OX P. If we confine our discussion to the 
kind of structures represented in Figure 6 (i.e, 
where there is no ambivalence and where all 
possible positive or negative lines are present), 
it will be apparent that: when P and O like 
each other, the s-digraph is balanced only if 
both persons either like or dislike X (s-digraph 
(a) is not balanced); when P and O dislike each 
other, the s-digraph is balanced only if one per- 
son likes X and the other person dislikes X (s- 
digraph (b) is balanced); and when one person 
likes the other but the other dislikes him, the 
sdigraph must be not balanced [s-digraph (c) 
is not balanced]. These conclusions are consis- 
tent with Heider's discussion of P-O-X units 
and with Newcomb's treatment of the A-B-X 
model. 

The further extension of the notion of bal- 
ance to sgraphs of type 2 remains to be made. 
The simplest procedure would be simply to 
ignore the types of lines involved. Then we 
would again define an sgraph of type 2 to be 
balanced if all of its cycles are positive. This 
definition appears to be consistent with Hei- 
der's intention, at least as it applies to a situa- 
tion containing only two entities. For in speak- 
ing of such situations having both L and U 
relations, he calls them balanced if both rela- 
tions between the same pair of entities are of 
the same sign (see 9, p. 110). There remains 
some question as to whether this definition will 
fit empirical findings for cycles of greater length. 
Until further evidence is available, we advance 
the above formulation as a tentative definition. 
Obviously the definition of balance can be given 
for «graphs of general type x in the same way. 


SOME THEOREMS ON BALANCE 


By definition, an sgraph is balanced if and 
only if each of its cycles is positive. In a given 
situation represented by an sgraph, however, it 
may be impractical to single out each cycle, 
determine its sign, and then declare that it is 
balanced only after the positivity of every cycle 
has been checked. Thus the problem arises of 
deriving a criterion for determining whether or 
not a given graph is balanced without having 
to revert to the definition. This problem is the 
subject matter of a separate paper (6), in which 
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two necessary and sufficient conditions for an 
s-graph to be balanced are developed. The first 
of these is no more useful than the definition 
in determining by inspection whether an s-graph 
is balanced, but it does give further insight into 
the notion of balance. Since the proofs of these 
theorems may be found in the other paper, we 
shall not repeat them here. 

Theorem. An s-graph is balanced if and only 
if all paths joining the same pair of points have 
the same sign. 

Thus, we can ascertain that the s-graph of 
Figure 7 is balanced either by listing each cycle 


w 


Figure 7. An s-graph of eight points and thirteen 
lines which, by aid of the structure theorem, can 
be readily seen as balanced. 


separately and verifying that it is positive, or, 
using this theorem, by considering each pair of 
possible points and verifying that all possible 
paths joining them have the same sign. For 
example, all the paths between points A and E 
are negative, all paths joining 4 and C are 
positive, etc. 

The following structure theorem has the ad- 
vantage that it is useful in determining whether 
or not a given s graph is balanced without an 
exhaustive check of the sign of every cycle, or 
of the signs of all paths joining every pair of 
points. 

Structure theorem. An s-graph is balanced if 
and only if its points can be separated into two 
mutually exclusive subsets such that each posi- 
tive line joins two points of the same subset 
and each negative line joins points from differ- 
ent subsets. 

Using the structure theorem, one can see at 
a glance that the sgraph of Figure 7 is bal- 
anced, for 4, B, C, D, and E, F, G, H are 
clearly two disjoint subsets of the set of all 
points which satisfy the conditions of the struc- 
ture theorem. 

It is not always quite so easy to determine 
balance of an sgraph by inspection, for it is 


not always necessarily true that the points of 
each of the two subsets are connected to each 
other. Thus the two s-graphs of Figure 8 are 
balanced, even though neither of the two dis- 
joint subsets is a connected subgraph. However, 
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Figure 8. Two s-graphs whose balance cannot be 
determined easily by visual inspection. 


the structure theorem still applies to both of 
the s-graphs of Figure 8. In the first graph the 
appropriate subsets of points are 4, D, E, H 
and B, C, F, G; while in the second one we 
take A,, By, Ag, By, As, Bg and Ag, By, 44, B. 

In addition to providing two necessary and 
sufficient conditions for balance, these theorems 
give us further information about the nature 
of balance. Thus if we regard the sgraph as 
representing Heider's L-relation in a group, 
then the structure theorem tells us that the 
group is necessarily decomposed into two sub- 
groups (cliques) within which the relationships 
that occur are positive and between which they 
are negative. The structure theorem, however, 
does not preclude the possibility that one of 
the two subsets may be empty—as, for example, 
when a connected graph contains only positive 
lines. 

The first theorem also leads to some inter- 
esting consequences. Suppose it were true, for 
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example, that when two people like each 
other they can influence each other positively 
(ie. produce intended changes in the other), 
but when two people dislike each other they 
can only influence each other negatively (i.e., 
produce changes opposite to those intended). 
An sgraph depicting the liking relations among 
a group of people will, then, also depict the 
potential influence structure of the group. Sup- 
pose that Figure 7 represents such a group. If 
A attempts to get H to approve of something, 
H will react by disapproving. If H attempts, in 
turn, to get G to disapprove of the same thing, 
he will succeed. Thus 4’s (indirect) influence 
upon G is negative. The first theorem tells us 
that A's influence upon G must be negative, 
regardless of the path along which the influence 
passes, since the s-graph is balanced. In general, 
the sign of the influence exerted by any point 
upon any other will be the same, no matter 
what path is followed, since the graph is bal- 
anced. 

By use of the structure theorem it can be 
shown that in a balanced group any influence 
from one point to another within the same 
clique must be positive, even if it passes through 
individuals outside of the clique, and the influ- 
ence must be negative if it goes from a person 
in one clique to a person in the other. (It 
should be noted that in this discussion we give 
the term "clique" a special meaning, as above.) 
Thus, under the assumed conditions, any ex- 
erted influence regarding opinions will tend to 
produce homogeneity within cliques and oppos- 
ing opinions between cliques. 

Although we have illustrated these theorems 
by reference to social groups, it should be ob- 
vious that they hold for any empirical realiza- 
tions of s-graphs, 


FURTHER CONCEPTS IN THE THEORY OF 
BALANCE 


The concepts of balance as developed up to 
this point are clearly oversimplifications of the 
full complexity of situations with which we 
want to deal. To handle such complex situa- 
tions more adequately, we need some further 
concepts. 

Thus far we have only considered whether a 
given sgraph is balanced or not balanced. But 
it is intuitively clear that some unbalanced s 
graphs are “more balanced” than others! This 
suggests the introduction of some scale of bal- 
ance, along which the “amount” of balance 


possessed by an unbalanced s-graph may be mea- 
sured. Accordingly we define the degree of bal- 
ance of an s-graph as the ratio of the number 
of positive cycles to the total number of cycles. 
In symbols, let G be an s-graph, 


c(G) = the number of cycles of G, 
c,(G) = the number of positive cycles of G, 
and 
b(G) = the degree of balance of G. 


Then 
c,(G) 


b(G) = «(G) 


Since the number c , (G) can range from zero 
to c(G) inclusive, it is clear that b(G) lies be- 
tween 0 and 1. Obviously b(G) — 1 if and only 
if G is balanced. We can give the number b(G) 
the following probabilistic interpretation: the 
degree of balance of an s-graph is the probabil- 
ity that a randomly chosen cycle is positive. 

Does b(G) = 50% mean that G is exactly one- 
half balanced? The answer to this question de- 
pends on the possible values which b(G) may 
assume. This in turn depends on the structure 
of the sgraph G. Thus, if G is the complete 
graph of 3 points and G is not balanced, then 
the only possible value is b(G) — 0, since there 
is only one cycle. Similarly, if the lines of G 
are as in Figure 9, some of which may be nega- 


A B 


D c 


Figure 9. A graph of four points which can ac- 
quire degrees of balance of only 33 and 1.00 re- 


gardless of the assignment of positive and negative 
signs to its five lines. 


tive, and if G is not balanced, then the only 
possible value is b(G)=4, and b(G) = 50% 
does not even occur for this structure. Thus 
any interpretation given to the numerical value 
of b(G) must take into account the distribution 
curve for b(G), which is determined by the 
structure of G. 

We now consider the corresponding concept 
of the degree of balance for sdigraphs. Since 
an sdigraph is balanced if all of its semicycles 
are positive, the degree of balance of an + 
digraph is taken as the ratio of the number of 


STRUCTURAL BALANCE: A GENERALIZATION OF HEIDER’S THEORY 321 


positive semicycles to the total number of semi- 
cycles. 

In a given s-graph which represents the signed 
structure of some psychological situation, it may 
happen that only cycles of length 3 and 4 are 
important for the purpose of determining bal- 
ance. Thus in an s-graph representing the rela- 
tion L in a complex group, it will not matter 
at all to the group as a whole whether a cycle 
of length 100, say, is positive. To handle this 
situation rigorously, we define an s-graph to be 
N-balanced if all its cycles of length not exceed- 
ing N are positive. Of course the degree of N- 
balance is definable and computable for any 
sgraph. Examples can be given of unbalanced 
sgraphs which are, however, N-balanced for 
some N. Figure 10 illustrates this phenomenon 


E D 


Figure 10. An s-graph which is 3-balanced but not 
4-balanced, 


for N = 8, since all of its 3-cycles are positive, 
but it has a negative 4-cycle.9 

For certain problems, one may wish to con- 
centrate only on one distinguished point and 
determine whether an s-graph is balanced there. 
This can be accomplished by the notion of 
local balance. We say an sgraph is locally bal- 
anced at point P if all cycles through P are 
positive. Thus the sgraph of Figure 11 is bal- 
anced at points 4, B, C, and not balanced at 
D, E, F. If this figure represents a sociometric 
structure, then the concept of local balance at 
A is applicable provided 4 is completely un- 
concerned about the relations among D, E, F. 

Some combinatorial problems suggested by 
the notions of local balance and N-balance have 
been investigated by Harary (7). The principal 
theorem on local balance, which follows, uses 
the term "articulation point" which we now 
define. An articulation point! of a connected 


9 One way of viewing the definition of N-balance is 
to regard cycles of length N as having weight 1, 
and all longer cycles as of. weight 0. Of course, it 
is possible to generalize this idea by assigning 
weights to each length, eg. weight 1/2 to length 
N. 
10 A characterization of the articulation points of 


graph is one whose removal!! results in a dis- 
connected graph. Thus the point D is the only 
articulation point of Figure 11. We now state 
the main theorem on local balance, without 
proof. 


Figure 11. An s-graph which is locally balanced at 
points A, B, C, but not balanced at points D, E, F. 


Theorem. If a connected sgraph G is bal- 
anced at P, and Q is a point on a cycle passing 
through P, where Q is not an articulation point, 
then G is also balanced at Q. 

Figure 11 serves to illustrate this theorem, for 
the sgraph is balanced at 4, and is also bal- 
anced at B but is not balanced at D, which is 
an articulation point. 

In actual practice, both local balance and N- 
balance may be employed. This can be handled 
by introducing the combined concept of local 
N-balance. Formally we say that an sgraph is 
locally N-balanced at P if all cycles of length 
not exceeding N and passing through P are 
positive. Obviously the degree of local N-bal- 
ance can be defined analogously to the degree 
of balance. 

In summary, the concept of degree of balance 
removes the limitation of dealing with only bal- 
anced or unbalanced structures, and in addition 
is susceptible to probabilistic and statistical 
treatment. The definition of local balance en- 
ables one to focus at any particular point of the 
structure, The introduction of N-balance frees 
us from the necessity of treating all cycles as 
equally important in determining structural 
balance. Thus, the extensions of the notion of 
balance developed in this section permit a study 
of more complicated situations than does the 
original definition of Heider. 


a graph, or in other words the liaison persons in a 
group, is given by Ross and Harary (19), using the 
"structure matrix" of the graph. An exposition of 
this concept is given in Harary and Norman (8). 

11 By the removal of a point of a graph is meant 
the deletion of the point and all lines to which it 
is incident. 
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ADEQUACY OF THE GENERAL THEORY OF 
BALANCE 


In any empirical science the evaluation of a 
formal model must be concerned with both its 
formal properties and its applicability to em- 
pirical data. An adequate model should account 
for known findings in a rigorous fashion and 
lead to new research. Although it is not our 
purpose in this article to present new data con- 
cerning tendencies toward balance in empirical 
systems, we may attempt to evaluate the ade- 
quacy of the proposed general theory of bal- 
ance in the light of presently available research. 

Our review of Heider's theory of balance 
and of the research findings related to it has 
revealed certain ambiguities and limitations 
concerning (a) the treatment of unsymmetric 
relations; (b) the generalization to systems con- 
taining more than three entities; (c) the distinc- 
tion between the complement and the opposite 
of a relation; (d) the simultaneous existence of 
relationships of different types; and (e) the ap- 
plicability of the concept of balance to empir- 
ical systems other than cognitive ones. We now 
comment briefly upon the way in which our 
generalization deals with each of these problems. 

Unsymmetric relations. It was noted above 
that, while theoretical discussions of balance 
have sometimes allowed for the possibility of 
unsymmetric relations, no rigorous definition of 
balance has been developed to encompass situ- 
ations containing unsymmetric relationships. 
Furthermore, empirical studies have tended to 
assume that liking is reciprocated, that each 
liking relation is symmetric. By stating the defi- 
nition of balance in terms of s-digraphs, we are 
able to include in one conceptual scheme both 
symmetric and unsymmetric relationships. And 
it is interesting to observe that, according to 
this definition, whenever the lines PO and OP 
are of different signs, the s-digraph containing 
them is not balanced. Thus, to the extent that 
tendencies toward balance have been effective 
in the settings empirically studied, the assump- 
tion of symmetry has, in fact, been justified. 

Situations containing any finite number of 
entities. Heider's discussion of balance has been 
confined to structures containing no more than 
three entities. The definition of balance ad- 
vanced here contains no such limitation; it is 
applicable to structures containing any finite 
number of entities. Whether or not empirical 
theories of balance will be confirmed by re- 
search dealing with larger structures can only 


be determined by empirical work. It is clear, 
however, that our generalization is consistent 
with the more limited definition of Heider. 

A relation, its complement, and its opposite. 
Using s-graphs and sdigraphs to depict rela- 
tionships between entities allows us to distin- 
guish among three situations: the presence of 
a relation (positive line), the presence of the 
opposite of a relation (negative line), and the 
absence of both (no line). The empirical utili- 
zation of this theory requires the ability to dis- 
tinguish among these three situations. In our 
earlier discussion of the literature on balance, 
we noted, however, a tendency to distinguish 
only the presence or absence of a relationship. 
It is not always clear, therefore, in attempting 
to depict previous research in terms of s-graph 
theory whether a given empirical relationship 
should be coordinated to no line or to a nega- 
tive line. 

The experiment of Jordan (11) illustrates 
this problem quite clearly. He employed three 
entities and specified certain U and L relations 
between each pair of entities. The empirical re- 
alization of these relations was obtained in the 
following way: U was made into “has some sort 
of bond or relationship with"; ~U into “has 
no sort of bond or relationship with"; L was 
made into “like”; and ~L into “dislike.” 
Viewed in the light of «graph theory, it would 
appear that Jordan created sgraphs of type 2 
(which may contain positive and negative lines 
of type U and type L). It would also appear, 
however, that the ~U relation should be de- 
picted as the absence of any U-line but that the 
~L relation should be depicted as a negative 
L-line. If this interpretation is correct, Jordan's 
classification of his situations as "balanced" and 
"imbalanced" will have to be revised. Instead 
of interpreting the ~U relation as a negative 
line, we shall have to view it as no U-line, with 
the result that all of his situations containing 
~U relations are vacuously balanced by our 
definition since there are no cycles. 

It is interesting to examine Jordan's data in 
the light of this reinterpretation. He presented 
subjects with 64 hypothetical situations, half of 
which were "balanced" and half "imbalanced" 
by his definition. He had subjects rate the de- 
gree of pleasantness or unpleasantness experi- 
enced in each situation (a high score indicating 
unpleasantness). For "balanced" situations the 
mean rating was 46 and for "imbalanced" ones, 
57. 

If, however, we interpret Jordan's ~U rela- 
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tion as the absence of a line, his situations must 
be reclassified. Of his 32 "balanced" situations, 
14 have no ~U relation and thus remain bal- 
anced. The mean unpleasantness score for these 
is 39. The remaining 18 of his "balanced" situ- 
ations, having at least one —U relation, become 
vacuously balanced since no cycle remains. The 
mean unpleasantness of these vacuously bal- 
anced situations is 51. Of Jordan's 32 “imbal- 
anced” situations, 19 contain at least one ~U 
relation, thus also becoming vacuously balanced, 
and the mean unpleasantness score for these is 
51. The remaining 13 situations, by having no 
~U relations, remain imbalanced, and their 
mean score is 66. Thus it is clear that the differ- 
ence in pleasantness between situations classed 
by Jordan as "balanced" and "imbalanced" is 
greatly increased if the vacuously balanced situ- 
ations are removed from both classes (balanced, 
39; vacuously balanced, 51; not balanced, 66). 
These findings lend support to our view that 
the statement "has no sort of bond or relation- 
ship with" should be represented as the absence 
of a line.12 

Relations of different types. A basic feature 
of Heider's theory of balance is the designa- 
tion of two types of relations (L and U). Our 
generalization of the definition of balance per- 
mits the inclusion of any number of types of 
relations. Heider discusses the combination of 
types of relations only for the situation involv- 
ing two entities, and it is clear that our defi- 
nition is consistent with his within this 
limitation. It is interesting to note that Jordan 
(11) finds positive liking relations to be experi- 
enced as more pleasant than negative ones. 
This finding may be interpreted as indicating 
a tendency toward “positivity” over and above 
the tendency toward balance. It is possible, 
however, that in the hypothetical situations 
employed by Jordan the subjects assumed posi- 
tive unit relations between each pair of entities. 
If this were in fact true, then a positive liking 
relation would form a positive cycle of length 
2 with the positive unit relation, and a nega- 
tive liking relation would form a negative cycle 


12 A strict test of our interpretation of Jordan's 
data is not possible since he specified for any given 
pair of entities only either the L or U relation. 
We can but guess how the subjects filled in the 
missing relationship. In the light of our discussion 
of relations of different types, in the next section, 
it appears that subjects probably assumed a positive 
unit relation when none was specified, since there 
is a marked tendency to experience negative liking 
relations as unpleasant. 


of length 2 with the positive unit relation. 
And, according to the theory of balance, the 
positive cycle should produce more pleasant- 
ness than the negative one. This interpretation 
can be tested only through further research in 
which the two relations are independently 
varied. 

Empirical applicability of concept of balance. 
Heider's discussion of balance refers to a cog- 
nitive structure, or the life space of a single 
person. Newcomb suggests that a similar con- 
ception may be applicable to interpersonal 
systems objectively described. Clearly, our defi- 
nition of balance may be employed whenever 
the terms "point" and "signed line" can be 
meaningfully coordinated to empirical data of 
any sort. Thus, one should be able to charac- 
terize a communication network or a power 
structure as balanced or not. Perhaps it would 
be feasible to use the same definition in describ- 
ing neural networks. It must be noted, however, 
that it is a matter for empirical determination 
whether or not a tendency to achieve balance 
will actually be observed in any particular 
kind of situation, and what the empirical conse- 
quences of not balanced configurations are. 
Before extensive utilization of these notions 
can be accomplished, certain further concep- 
tual problems regarding balance must be 
solved. 

One of the principal unsolved problems is 
the development of a systematic treatment of 
relations of varying strength. We believe that 
it is possible to deal with the strength of rela- 
tions by the concept of a graph of strength o, 
suggested by Harary and Norman (8). 


SUMMARY 


In this article we have developed a generali- 
zation of Heider's theory of balance by use of 
concepts from the mathematical theory of lin- 
ear graphs. By defining balance in graph- 
theoretic terms, we have been able to remove 
some of the ambiguities found in previous dis- 
cussions of balance, and to make the concept 
applicable to a wider range of empirical situ- 
ations than was previously possible, By intro- 
ducing the concept degree of balance, we have 
made it possible to treat problems of balance 
in statistical and. probabilistic terms. It should 
be easier, therefore, to make empirical tests of 
hypotheses concerning balance. 

Although Heider's theory was originally in- 
tended to refer only to cognitive structures of 
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an individual person, we propose that the defi- ture research can determine whether theories 
nition of balance may be used generally in of balance can be established for all of these 
describing configurations of many different configurations. The definitions developed here 
sorts, such as communication networks, power do, in any case, give a rigorous method for 
systems, sociometric structures, systems of orien- describing certain structural aspects of empiri- 
tations, or perhaps neural networks. Only fu- cal configurations. 
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35. Cognitive Structure and Attitudinal Affect 


MILTON J. ROSENBERG 


Understanding of the related processes of atti- 
tude learning and attitude change will prob- 
ably be advanced by the investigation of 
structural relationships between attitudes and 
beliefs about the objects of attitudes. The pres- 
ent research is an attempt to verify a set of 
hypotheses about such relationships. These hy- 
potheses have much in common with some that 
have already been presented by Cartwright (2), 
Hilliard (3) Smith (8), Tolman (9), and 
Woodruff (11, 12). However, they differ from 
these earlier formulations in attempting to 
delineate more explicitly certain variables that 
are assumed to covary with attitude (here de- 
fined as relatively stable affective response to 
an object), One of these variables is the inten- 
sity of a person’s values. A second is the per- 
ceived importance of the attitude object in 
leading to or blocking the attainment of his 
values. 

The general theoretical view underlying the 
present study includes the following points: 
(a) When a person has a relatively stable ten- 
dency to respond to a given object with either 
positive or negative affect, such a tendency is 
accompanied by a cognitive structure made up 
of beliefs about the potentialities of that object 
for attaining or blocking the realization of val- 
ued states; (b) the sign (positive or negative) 
and extremity of the affect felt toward the ob- 
ject are correlated with the content of its asso- 
ciated cognitive structure. Thus strong and 
stable positive affect toward a given object 
should be associated with beliefs to the effect 
that the attitude object tends to facilitate the 
attainment of a number of important values, 
while strong negative affect should be associ- 


* Reprinted from Journal of Abnormal and Social 
Psychology, 1956, 58, 367-372, with permission of 
the author and the American Psychological Associ- 
ation. The study was supported by the U. S. Air 
Force, Human Resources Research Institute, under 
contract # AF33-(038)-26646. 


ated with beliefs to the effect that the attitude 
object tends to block the attainment of im- 
portant values. Similarly, moderate positive or 
negative affects should be associated with be- 
liefs that relate the attitude object to less im- 
portant values or, if to important values, then 
with less confidence as to the existence of clear- 
cut instrumental relationships between the 
attitude object and the values in question. 

From this view, three specific hypotheses 
were formulated for experimental test: 


l. The degree and sign of affect aroused in 
an individual by an object (as reflected by the 
position he chooses on an attitude scale) vary 
as a function of the algebraic sum of the 
products obtained by multiplying the rated 
importance of each value associated with that 
object by the rated potency of the object for 
achieving or blocking the realization of that 
value. 

2. The degree and sign of affect aroused in 
an individual by an object (as reflected by the 
position he chooses on an attitude scale) vary 
as a function of the algebraic sum of his rat- 
ings of the potency of that object for achieving 
or blocking the realization of his values (when 
the importance of these values is held constant). 

3. The degree and sign of affect aroused in 
an individual by an object (as reflected by the 
position he chooses on an attitude scale) vary 
as a function of the algebraic sum of his rat- 
ings of the importance of the values whose 
attainment or blocking he perceives to be af- 
fected through the instrumental potency of 
that object (when the instrumental potency of 
that object for attaining or blocking each of 
these values is held constant). 


METHOD 


Subjects and Attitude Measure 


One hundred and twenty Ss, recruited from 
undergraduate courses at the University of 
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Michigan and the Ypsilanti State Teachers 
College, took an attitude questionnaire that 
contained, among other items, one dealing 
with the issue of “whether members of the 
Communist Party should be allowed to address 
the public." Each S checked his first choice 
among five alternative statements. Seventeen 
$s chose the alternative indicating extreme op- 
position to allowing members of the Commu- 
nist Party to address the public, 31 Ss the 
alternative indicating moderate opposition, 44 
Ss that indicating moderate approval, and 25 
Ss that indicating extreme approval. Only three 
Ss chose the alternative indicating “neutrality” 
on the issue, and, because of their small num- 
ber, were excluded from the research popula- 
tion. In a retest of 95 Ss on this measure after 
an interval of at least two months, a reliability 
coefficient of .72 was obtained (p < .001). 


Value Measures 


Three to five weeks after the administration 
of the attitude measure two card-sorting tasks 
were administered individually. These tasks 
required $ to categorize each of a group of 
value items in terms of (a) value importance, 
ie, its importance to him as a "source of satis- 
faction" and (b) perceived instrumentality, i.e., 
his estimate as to whether, and to what extent, 
the value in question would tend to be 
achieved or blocked through the "policy of 
allowing members of the Communist Party to 
address the public." 

The pack of value-cards included 35 items 
constructed in the light of White's value- 
analysis technique (10) and Murray's analysis 
of major needs (6). (The value items used are 
given in Table 1.) In addition to these 35 
items, the pack of cards presented to each S 
also contained value terms that had been coded 
out of his questionnaire answer to a verbal 
probe in conjunction with the item on "allow- 
ing members of the Communist Party to ad- 
dress the public." For most Ss the total number 
of such "salient" values came to two or three, 
the range among all Ss extending from zero 
to six. 

For the "value importance" measure S was 
asked to rank each card so as to indicate how 
much satisfaction he gets, or would get, from 
the value state that it described. Each value 
was to be judged independently in terms of 
21 categories ranging from "gives me maximum 
satisfaction” (Category +10) through “gives me 
neither satisfaction nor dissatisfaction” (Cate- 


TABLE 1. Value Items Used in Card-Sorting 
Tasks* 


iis 
= 


3. 
. Change and variety; having new kinds of 


26. 


27. 


People sticking to their own groups. 
People looking out for the welfare of 
others. 

Being looked up to by others. 


experience. 


. Sticking to a difficult task; solving difficult 


problems. 


. Making one’s own decisions. 
. People being strongly patriotic. 
. Serving the interests of the group to which 


one belongs. 


. Giving expression to feelings of anger or 


hostility. 


. Keeping promises made to others. 
. Having one's family approve of one's views. 
. The uncompromising administration of 


punishment to anyone who deserves it. 


. Having interesting work to do. 

. Having power and authority over people. 

. Being well-informed about current affairs. 
. Having the value of property well pro- 


tected. 


. People of different backgrounds getting to 


know each other better. 


. All human beings having equal rights. 


Being good-looking; having attractive face, 
body, or clothes. 


. Having a steady income. 
. Believing in a relationship between the 


individual and some higher spiritual power. 


. America having high prestige in other 


countries. 


. Being liked or loved by the opposite sex or 


associating with the opposite sex. 


. Getting advice on important problems. 
. People having the right to participate in 


making decisions which will affect them. 
Achieving superiority over others in such 
things as knowledge, work, or sports. 
Complying with the wishes of persons in 
authority. 


. Being like others in general; having the 


same interests. 


?9. People being well educated. 


. People having strict moral standards. 
. The open expression of disagreement be- 


tween people. 


. Everyone being assured of a good standard 


of living. 


. Being allowed to maintain the privacy of 


one's opinions and beliefs. 


. Being with other people; socializing. 
35. 


Letting others make their own decisions. 


* This list does not include the “salient” value items specially constructed for each S. 
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gory 0) to “gives me maximum dissatisfaction” 
(Category —10). For the measure of “perceived 
instrumentality" he was asked to judge and 
place each card in terms of 11 categories rang- 
ing from "the condition (value described on 
the card) is completely attained by allowing 
admitted Communists to address the public" 
(Category 4-5) through "whether or not admit- 
ted Communists are allowed to address the 
public is completely irrelevant to the attain- 
ment of the condition" (Category 0) to “the 
condition is completely blocked by allowing 
admitted Communists to address the public" 
(Category —5). In each of the two card-sorting 
tasks $ was required, after the initial categori- 
zation was completed, to rank the cards within 
each category in terms of "value importance" 
and "perceived instrumentality" respectively. 

The test-retest reliability of these two mea- 
sures was studied with a subpopulation of 12 Ss, 
the second administration following the first 
by from four to five weeks. For each of these 
Ss Spearman's rho was computed as an estimate 
of the degree of correlation between the ranks 
assigned to the separate values on the first and 
second administrations respectively. For the 
measure of value importance, the median rho 
was .89 (p <.01); for that of perceived instru- 
mentality, it was .74 (p < .01). 

From the data obtained through the two 
card-sorting tasks ten indices were computed 
to permit testing the major hypotheses. Opera- 
tional definitions of these indices are given in 
Tables 2 to 4. 


RESULTS AND DISCUSSION! 


Each hypothesis was tested by chi square,? 
computed from a 3x4 table. In each such 
table the four groups in terms of attitude posi- 
tion (extremely favorable, moderately favor- 
able, moderately unfavorable, and extremely 


1 All of the results reported in this section have 
been successfully replicated with regard to the atti- 
tude area of “allowing Negroes to move into white 
neighborhoods.” The replication was based on an 
analysis exactly parallel to the one described in the 
present report and upon identical measurement 
operations. 

2 This technique was used rather than correlational 
methods because it was felt that the assumptions 
required by correlational methods (particularly the 
assumptions of normality and equal-interval scales) 
were not fully met by the data, The method used 
for computing chi square was the “maximum like- 
lihood test" of Mood (5) which is applicable to data 
plots with low cell counts. 


unfavorable toward “allowing members of the 
Communist Party to address the public”) were 
cross classified in terms of a threefold categori- 
zation of the Ss on one of the indices derived 
from the card-sorting value measures. 


Hypothesis 1 

All of the four indices used to test Hypothesis 
1 provide, as the hypothesis requires, for rep- 
resentation of both the importance to the indi- 
vidual of some set of values and his perceptions 
as to how these values are affected (with regard 
to their attainment or blocking) through the 
instrumental agency of the attitude object 
(the policy of "allowing members of the Com- 
munist Party to address the public". By 
algebraically summing the importance-instru- 
mentality products for each of the values it is 
possible to represent appropriately the inter- 
action between positive and negative values 
on the one hand and perceptions of positive 
and negative instrumentality on the other. 
Thus a positive value (rated say as +7 in its 
capacity to satisfy the $) may be perceived as 
being attained through the policy of "allowing 
members of the Communist Party to address 
the public" (to the extent say, of a rating of 
+3 on "instrumentality"). In this case the prod- 
uct would be +21. If on the other hand, the 
same value were rated as being blocked by the 
policy of "allowing members of the Communist 
Party to address the public" (for example a 
rating of —3 on "instrumentality") the resultant 
product would be —21. Similarly, a value rated 
as yielding dissatisfaction and thus bearing a 
minus sign on "value importance" would yield 
a negative product if multiplied by a rating of 
positive instrumentality and a positive product 
if multiplied by a rating of negative instru- 
mentality. By summing all such products for 
a given set of values a single algebraic quantity 
is obtained representing the total import of the 
S's pattern of beliefs about the influence of the 
attitude object upon attaining or blocking vari- 
ous states that he values to differing degrees. 

Since all four of the indices used to test 
Hypothesis 1 were found to be significantly re- 
lated to attitude position (see Table 2), the 
hypothesis would appear to have been con- 
firmed at an acceptable level. These findings, 
to generalize, lend support to the view that 
beliefs associated with an attitudinal affect 
tend to be congruent with it; ie. that there 
exists within the individual an "organization" 
of the affective and cognitive properties of his 
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total pattern of response to what is, for him, 
an “attitude object.” 


TABLE 2. Relationships between “Value 
Importance X Perceived Instrumentality” 
(Indices 1 to 4) and Attitude Position 
(N —117) 


Description Chi Proba- 
of Index Square — bility* 


1 Algebraic sum of 26.33 001 
importance- 
instrumentality 
products for all 
value items 

2 Algebraic sum of 
importance- 
instrumentality 
products for the 
twenty values 
ranking 
highest on 
importance 

3 Algebraic sum of 
importance- 
instrumentality 
products for all 
"salient" values 

4 Algebraic sum of 
importance- 
instrumentality 
products for all 
“non-salient” 
values 


Index 


30.82 001 


45.09 001 


15.10 .02 


* Examination of the actual data plots reveals 
that all of these significant chi squares are due 
to positive, monotonic relationships, 


One possible objection to such an interpreta- 
tion might hold that the way in which 5 sorts 
the cards may reflect not his actual cognitive 
structure but rather an attempt to demonstrate 
to the experimenter and to himself that his 
beliefs are consistent with his attitudes. If 
striving for cognitive-affective consistency ac- 
counts in part for relationships obtained during 
the testing session, however, it seems appro- 
priate to assume that similar motivation had 
been in operation before the testing session, 
ie, that the motive to “rationalize” one’s atti- 
tudes plays a role in the natural history of 
attitude development. 

Still another line of evidence that limits the 
significance of this objection arises from the 


findings obtained with Index 3. This index is 
based solely upon the “value importance” and 
“perceived instrumentality” ratings of the S's 
“salient” values, which were coded out of his 
own verbal defense of his stated attitude posi- 
tion. Certainly the individual's estimates of the 
importance of these values and the ways in 
which their realization would be influenced 
by the attitude object must have had pre- 
experimental reality to have been given expres- 
sion in response to the questionnaire probe. 

It should be noted, however, that the his- 
torical relationship between affective and cog- 
nitive processes in the development of attitudes 
is not approachable through analysis of the 
present data. The province of the present study 
is to examine the relationships between cogni- 
tive and affective processes at a point in time 
after an attitude has been acquired and sta- 
bilized. Nevertheless, the present hypotheses 
may, if confirmed, suggest further hypotheses 
about the processes of attitude learning and 
attitude change. 


Hypotheses 2 and 3 


While the data just reviewed confirm the 
existence of a relationship between a measure 
of affect toward an object and a measure based 
upon both the "importance" and "perceived 
instrumentality" of values associated with it, 
the question may be raised as to whether each 
of these two latter variables may be demon- 
strated to covary with affect when the other 
is held constant. Hypotheses 2 and 3 assert 
the existence of such relationships. Indices 
5-10 are based upon procedures for obtaining 
separate estimates of the "value importance" 
and "perceived instrumentality" variables. To 
isolate the one variable, it was possible to hold 
the other constant since the original card- 
sorting tasks had required $ to rank the values 
within each of the categories used in sorting. 

Thus, to test the relationship between “pet 
ceived instrumentality” of values and affect 
toward the attitude object, four indices (58) 
were developed, all of which have in common 
being based upon a summation of the "per 
ceived instrumentality” ratings of a group of 
values defined and chosen in terms of their 
“value importance” ranks. Index 5 for example 
is obtained by choosing the value items ranked 
1-20 by the $ on the dimension of “value im- 
portance” and then obtaining the algebraic 
sum of the “perceived instrumentality” ratings 
of these same 20 values. 
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A similar procedure was employed with 
regard to the indices (9 and 10) used to test 
the relationship between "value importance" 
and affect toward the attitude object. These 
two indices are both based upon the “value 
importance" ratings of a group of values de- 
fined and chosen in terms of their "perceived 
instrumentality" ranks. Index 9 for example 
is obtained by choosing the five values ranked 
highest on "perceived positive instrumentality" 
and then obtaining the algebraic sum of the 
"importance" ratings of these same five values. 
Index 10 is identical except that it is computed 
from the five values ranked highest on "per- 
ceived negative instrumentality.” 

The results reported in Table 3 indicate that 
all four of the indices of “perceived instru- 
mentality” are significantly related to attitude 
position. Furthermore it should be noted that 
the hypothesis has been verified at a number 


TABLE 3. Relationships between “Perceived 
Instrumentality" (Indices 5 to 8) and 
Attitude Position (N = 117) 


Description Chi Proba- 
Index of Index Square — bility* 


ÉÉ—— E 

5 Algebraic sum of 35.51 .001 
the instru- 
mentality ratings 
of the values 
ranking Ist-20th 
on importance 

6 Algebraic sum of 
the instru- 
mentality ratings 
of the values 
ranking 11th-30th 
on importance 

7 Algebraic sum of 20.82 01 
the instru- 
mentality ratings 
of the values 
ranking Ist-10th 
on importance 

8 Algebraic sum of 38.47 .001 
the instru- 
mentality ratings 
of the values 
ranking 11th-20th 
on importance 


21.27 .001 


* Examination of the actual plots reveals that 
all of the significant chi squares are due to the 
presence of positive, monotonic relationships. 


of different levels of "value importance" con- 
trol, each of the four indices representing a 
different level. On the basis of these findings 
Hypothesis 2 appears to have been confirmed. 
To generalize, other things (particularly “value 
importance”) being equal, extreme attitudinal 
affects are associated with perceptions of close 
positive or negative instrumental connection 
between the attitude object and related values, 
while moderate attitudinal affects are associated 
with perceptions of less clear-cut instrumental 
relationships. In terms of attitude theory this 
finding suggests the existence of a distinguish- 
able and important dimension of attitude- 
related cognitive structures. Recent research (1) 
based upon the present study lends confirma- 
tion to the existence of this dimension as a 
separate and manipulable one and indicates 
that attitude-change effects (in the sense of 
affective change) may be achieved through its 
manipulation. 

The results reported in Table 4 indicate that 
both of the indices of “value importance" are 


TABLE 4. Relationships between "Value 
Importance" (Indices 9 and 10) and 
Attitude Position (N = 117) 


Description Chi Proba- 
Index of Index Square  bility* 


ee 

9 Algebraic sum of 12.75 .05 
the importance 
ratings of the 
values ranking 
Ist-5th on 
positive 
instrumentality 
(attainment 
of values) 

10 Algebraic sum of 
the importance 
ratings of the 
values ranking 
Ist-5th on 
negative 
instrumentality 
(blocking of 


values) 
a 


* Examination of the actual plots reveals that 
the significant chi square in the case of Index 
9 is due to the presence of a positive monotonic 
relationship, while the significant chi square in 
the case of Index 10 is due to the presence of 
a negative monotonic relationship, both as pre- 
dicted. 


16.46 .02 
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significantly related to attitude position. On 
the basis of these findings, Hypothesis 3 ap- 
pears to have been confirmed. ; 
Although the relationships reported in Table 
4 reach an acceptable level of significance, 
however, their significance level is lower than 
most of the relationships reported in Tables 2 
and 3. The present data leave it an open ques- 
tion as to whether this is an artifact of the 
measurements or computations employed or 
whether, on the other hand, “perceived instru- 
mentality” actually controls more variance in 
attitudinal affect than does “value importance.” 
The generalization is nevertheless supported 
that other things (particularly “perceived in- 
strumentality") being equal, extreme attitudinal 
affects are associated with values of high im- 
portance while moderate attitudinal affects are 
associated with values of less importance. For 
attitude theory, these results suggest the exis- 
tence of a second distinguishable and impor- 
tant dimension of attitude-related cognitive 
structures. Further research is needed to deter- 
mine whether the individual's assessment of the 
importance of his values is manipulable and 
whether, if it is, such manipulation is sufficient 
to produce change in affect toward an object 
seen as having instrumental influence upon 
the attainment or blocking of these values. 


Further Implications for Attitude Theory 


It has been suggested that attitudinal effect 
toward an object may be altered by the prior 
modification of value importance and perceived 
instrumentality, A related implication is that 
one way in which attitudes may originally de- 
velop is through the prior acquisition of beliefs 
about the value-attaining or value-blocking 
powers of particular objects, be those objects 
individuals, groups, political proposals, or com- 
mercial products. Such a view, even if it has 
not been systematically elaborated, seems to 
underlie the attitudecreating and attitude- 
changing communication techniques of many 
propagandists in such fields as advertising, 
teaching, psychological warfare, and child- 
rearing. 

A less obvious implication of the present 
findings suggests at least one other type of 
sequential process leading to the establishment 
of stabilized | affective-cognitive patterns, It 
seems plausible that much original attitudinal 
learning (the acquisition of a stable affective 
response where none existed previously) may 
originate in experiences of being rewarded or 


punished for imitation or rehearsal of expres- 
sions of affect provided by others. A case in 
point may be found in the data on the acquisi- 
tion of the anti-Negro attitude as reported by 
Horowitz (4). With an affect, or its beginnings, 
established through reinforcement procedures, 
something like a need for affective-cognitive 
consistency may set the person to acquire 
socially available beliefs that “rationalize” the 
acquired affect. When such beliefs are unavail- 
able in the person’s communicative surround, 
he may invent them. In real-life situations 
where attitudes are being instilled, there are 
generally available both cognitive supports for 
the advocated affect and direct reinforcements 
for its expression. 

Recent research by the investigator seems 
to indicate that the production of change in an 
already established attitude may be obtained 
without any direct attempt to modify the asso- 
ciated cognitive structure, but rather through 
direct assault (with hypnotic techniques) upon 
the established affect. Such assault produces 
in some Ss a temporary affective reversal, and 
along with this reversal, a spontaneous, self- 
directed modification of the related cognitive 
structure, 


SUMMARY 


An attitude questionnaire on the issue of 
“allowing members of the Communist Party to 
address the public” was administered to 117 
Ss. Three to five weeks later each S took a 
card-sorting test which required him to rate 
and rank each of a group of value items in 
terms of (a) the importance of the value as à 
source of satisfaction and (b) his perception as 
to the extent to which the value tends to be 
attained or blocked through the instrumental 
agency of the attitude object. 

The data thus gathered were used in testing 
three hypotheses concerned with the relation- 
ships between stable affective response (atti- 
tude) toward an object and beliefs about that 
object. These hypotheses were derived from à 
general proposition to the effect that the sign 
and extremity of affect toward an object are 
functions both of whether it is perceived as 
facilitating or blocking the attainment of 
values and of whether or not the values in- 
volved are important ones. All three hypoth- 
eses were confirmed. Some of the data were 
interpreted as suggesting that "value im- 
portance" and "perceived instrumentality" are 
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separate and possibly, manipulable dimensions implications of the present study with regard 


of attitude-related cognitive structures. to the phenomena of attitude learning and 
Some suggestions were presented as to the attitude change. 
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36. Attitudinal Consequences of Induced Discrepancies 
Between Cognitions and Behavior 


ARTHUR R. COHEN 


* 


Many situations in daily life involve engaging 
in disagreeable behavior or considering infor- 
mation contrary to one's own attitudes and 
values. The reasons for becoming involved in 
situations of this discrepant character do not 
concern us here. Our major purpose is to deal 
with the psychological processes which occur 
in such situations and their consequences for 
attitude change. 


THEORETICAL BACKGROUND 


It has often been demonstrated that forcing 
à person to express an opinion position dis- 
crepant from what he privately believes results 
in a change of private opinion! According to 
Festinger's analysis, expressing an opinion dis- 
crepant from one's privately held position cre- 
ates “dissonance,” or psychological tension 
having drive character. The tension may be 


* Excerpted from Public Opinion Quarterly, 1960, 
24, 297-318, with permission of the publisher. This 
paper was prepared as part of the program of the 
Research Center for Human Relations, New York 
University. Most of the research cited was carried 
out by the author and J. W. Brehm while both 
were at Yale University. 


1 H. Burdick, “The Compliant Behavior of Devi- 
ates under Conditions of Threat,” Minneapolis, 
University of Minnesota, 1955, unpublished doc- 
toral dissertation. I. L. Janis and B. T. King, "The 
Influence of Role Playing ín Opinion Change," 
Journal of Abnormal and Social Psychology, Vol. 
58, 1959, pp. 203-210. H. C. Kelman, “Attitude 
Change as a Function of Response Restriction,” 
Human Relations, Vol. 6, 1953, pp. 185-214. D. Mc 
Bride, "The Effects of Public and Private Changes 
of Opinion on Intra-group Communication," Min- 
neapolis, University of Minnesota, 1955. unpub. 
lished doctoral disertation. B. H. Raven, "Social 
Influence on and the Communication of 
Related Content," Journal of Abnormal and Social 
Psychology, Vol. 58, 1959, pp. 119-129. 


reduced by changing one's private position to 
coincide more nearly with the position ex- 
pressed.2 

While the direction of attitude change may 
thus be understood, what, according to dis- 
sonance theory, controls just how much tension 
the person experiences? Or, more formally, we 
may ask: What are the factors controlling the 
magnitude of dissonance created and therefore 
the magnitude of consequent attitude change? 
One obvious controlling factor is the impor- 
tance to the individual of the various cogni- 
tions, or bits of information, relevant to his 
behavior. Other things being equal, the more 
important to him his private opinion and what- 
ever rewards or punishments that lead him to 
engage in a discrepant behavior, the greater 
will be the resulting amount of dissonance. 
In other words, inducing a person to express 
à discrepant position on an issue that is very 
important to him will create more dissonance 
than inducing him to express a discrepant 
position on a trivial issue, 

Festinger's theory states that the cognitions 
that lead to the behavior are consonant with 
it. Those that deter the behavior are dissonant 
with it. A second, and more central, deter- 
minant of the amount of dissonance is, then, 
the ratio of dissonant to consonant cognitions. 
The greater this ratio, the greater will be the 
dissonance. In regard to a discrepant behavioral 
stand or opinion expression, the greater the 
importance and number of cognitions against 
having made it compared to the importance 
and number for it, the greater will be the dix 


2 L. Festinger, A Theory of Cognitive Dissonance, 
Evanston, Ill, Row, Peterson, 1957. The stimulation 
for the studies to be discumed comes from Fest 
inger's theoretical statement. Most of the data were 
collected to test inferences made on the basis of 
these derivations. 
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sonance. It follows that if a person has a pri- 
vate opinion on an issue and is induced to 
behave in a way which is discrepant from it, 
then the greater the number and importance 
of cognitions leading to the discrepant be- 
havior, the less will be the amount of dis- 
sonance produced. Or, to put it another way, 
the more compelling the reasons for taking the 
discrepant stand, the less will be the dissonance 
created by taking it. 

Fortunately, there are many ways in which 
the individual can reduce dissonance. He can, 
in general, change any or all of the cognitions 
so that they will lead more strongly to the ex- 
presion made. He can increase the felt 
importance of the intrinsic rewards or punish- 
ments that led to the discrepant behavior. And, 
more importantly for our interests, he can re- 
duce or completely eliminate his dissonance by 
changing his private opinion so that it is more 
consistent with the expression made. 

It is expected that the greater the amount 
of dissonance, the stronger will be the attempts 
to reduce it. Since dissonance can be reduced 
by attitude change, we should be able to pre- 
dict the amount of attitude change as a func- 
tion of the variables that control the amount 
of dissonance created. Thus, given the expres- 
sion of opinion discrepant from the position 
held privately, the more equal the balance 
between cognitions leading toward and against 
the behavior, the greater will be the magnitude 
of dissonance and consequent attitude change 
in the direction of the behavior or expression. 
Finally, we may state the major idea: The 
greater the number and/or the importance of 
the cognitions .leading to the behavior dis- 
crepant from one's private opinion, the less 
will be the dissonance created and the less 
will be the consequent amount of attitude 
change toward the expression or position repre- 
sented by the behavior. In other words, the 
more compelling the reasons for taking a pub- 
lic stand that differs from what one really be- 
lieves, the smaller will be the attitude change 
toward the expressed. position. 

We may now summarize our analysis. When- 
ever one’s cognitions about extrinsic forces lead 
to a public expression different from one’s pri- 
vate opinion, dissonance will inevitably result. 
The amount of dissonance will increase as the 
general importance of the guiding cognitions 
increases, and as the balance between the op- 
posing cognitions increases. When a person 
experiences dissonance he will try to reduce 


or eliminate it in proportion to its strength. 
In general, he may do this by making his cog- 
nitions more consistent with the expression 
already made. Thus, one major consequence 
will be a change in one's private opinion to 
make it more consistent with the position ex- 
pressed. And, finally, as the imbalance of cog- 
nitions leading to the alternate expressions 
becomes greater, the amount of dissonance and 
consequent attitude change will be smaller. 
Let us now examine some evidence relevant 
to these theoretical considerations and see 
what additional issues important to communi- 
cation and attitude change emerge. r 


RESEARCH EVIDENCE 


Motivational Determinants of Cognitive 
Dissonance and Consequent Attitude Change 
The foregoing analysis leaves some question 
as to what cognitions lead to an opinion ex- 
pression or behavior discrepant from that pri- 
vately held. One cognition is the promise of 
a reward or prize with monetary value. If the 
value of such a prize is meaningful, people can 
be induced to behave publicly in a way that is 
inconsistent with what they privately believe. 
In all the previously cited experiments, in 
which persons were induced to express posi- 
tions discrepant from their private opinions, 
opinion change in the direction of the induced 
position was found. More specific evidence 
comes from the experiment by Kelman in 
which he induced children to write essays dis- 
crepant from their private opinions? In one 
of his conditions everyone who wrote a dis- 
crepant essay was to get a prize, while in the 
other only the few best essays would earn a 
prize. Festinger has pointed out that certainty 
of getting a prize is a more compelling reason 
for compliance than is the uncertainty of a 
competitive situation and should consequently 
produce less dissonance and resultant opinion 
change.4 Kelman does indeed report less opin- 
ion change where his subjects were certain of 
getting the prize, but he and others have 
maintained that the opinion change he ob- 
tained, as well as that in the other previously 
cited experiments, may result simply from a per- 
son's persuading himself, much as a communi- 
cator might. In support of this explanation, the 
average essay strength, in terms of number of 


3 Kelman, op. cit. 
4 Festinger, op. cit. 
5 Janis and King, op. cit. 
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arguments, in Kelman’s experiment was greater 
in the condition that produced the most opin- 
ion change. What was needed, therefore, was 
an unequivocal test of each explanation. 

An experiment by Brehm, generally pat- 
terned along the lines of Kelman's experiment, 
appears to give clear support to the dissonance 
theory analysis.6 Junior high school students 
were asked to indicate on a questionnaire how 
they would like it if summer vacations were 
shortened to eight weeks, five weeks, or elimi- 
_ nated altogether. To each of these questions 
they were to check one answer of a seven-point 
scale ranging from, "I'm very much in favor 
of it,” to "I'm very much against it." Though 
this questionnaire required their names, the 
essentially anonymous nature of it was stressed. 

One week later the actual experiment was 
conducted. Four experimental conditions were 
created. The first was a control in which sub- 
jects were simply asked to fill out the question- 
naires again, this time without putting their 
names on them. There was, of course, a method 
of identifying them. In the other three con- 
ditions the experimenter said he wanted to 
know what reasons the students might have 
for wanting shorter vacations. He said he was 
therefore willing to give a prize to anyone 
who would write a strong essay in favor of 
shorter vacations. In order to vary the imbal- 
ance of cognitions leading to and from com- 
pliance with the request, the magnitude of the 
prize value was varied. In the low-value con- 
dition, subjects could have any one of the fol- 
lowing: a phonograph record, a movie ticket, 
Or a ticket to a skating rink; in the medium- 
value condition they could have two of these: 
and in the high-value condition they could 
have four. Anyone who wanted could write in 
favor of keeping vacations long, but he would 
not get a prize. After the subjects completed 
their essays, they filled out the questionnaires 
again, without signing them. 

In short, then, students were induced to 
write statements against their private opinions, 
in one case for an attractive prize, in a second, 
for a very attractive prize, and in a third, for 


9 The theoretical account in the previous section 
and this brief description of Brehm's experiment 
on forced verbal compliance were adapted from a 
speech given by Brehm at the 1957 meeting of the 
Amerícan Association for Public Opinion Rescarch, 
held in Washington, D.C. (J. W. Brehm, "Forced 
Verbal Compliance and Attitude Change," New 
Haven, Conn., Yale University, 1957, unpublished 
manuscript.) 


an extremely attractive prize. Their private 
attitudes on the issue, their judgment about 
the fairness of giving prizes for compliance, 
and their perception of experimenter friendli- 
ness were measured before and after the in- 
duced behavior. 

We might expect, according to dissonance 
theory, that, as the prize value increases, one's 
private opinion becomes relatively less im- 
portant, thereby creating less and less dis- 
sonance. The results indicate this is indeed 
what happens. Persons complying for the high- 
value prize show less attitude change in the 
direction of compliance than do those in the 
medium- or low-value conditions. 

It should be noticed that this effect is diff- 
cult to explain in terms of verbal self- 
persuasion. The analysis of essay strength in 
terms of number of words and arguments per 
essay indicates that as prize value increases 
there is a very slight, generally nonsignificant, 
trend for essay strength to increase. Thus this 
small trend for essay strength to increase is 
accompanied by less, rather than more, opinion 
change.7 

Another experiment by Festinger and Carl- 
smith was designed to test similar derivations 
from the theory.8 Subjects were given a very 
boring experience and then paid to tell some- 
one that the experience had been interesting 
and enjoyable. After varying amounts of 
money were paid to the subjects, their pri- 
vate opinions concerning the experience were 
determined. The results show that if a person 
is induced to say something contrary to his 
private opinion, the larger the financial in- 
ducement, the weaker will be the tendency to 
change his opinion to correspond with what 
he has said. 

Other research by Cohen, Brehm, and Flem- 
ing has shown that another type of inducing 
force has consequences for dissonance and 
attitude change.® College students were re- 


7 In general, however, where inducement varies, an 
attempt is made through experimental controls to 
keep essay strength constant. This makes it difficult 
to interpret the results in terms of self-persuasion 
or differing involvement. 

8 L. Festinger and J. M. Carlsmith, "Cognitive Con- 
sequences of Forced Compliance," Journal of Ab- 
normal and Social Psychology, Vol. 58, 1959, pp. 
203-210. 

9 A. R. Cohen, J. W. Brehm, and W. H. Fleming, 
"Attitude Change and Justification for Compliance, 
Journal of Abnormal and Social Psychology, Vol. 
56, 1958, pp. 276-278. 
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quested to write essays supporting a position 
opposite to their own on a current-event issue. 
In one condition, they were given minimal rea- 
sons for writing the discrepant essay, while in 
another they were given numerous reasons, 
for example, that it would help the experi- 
menter and social science, that everyone should 
cooperate, and that the college authorities were 
interested. The assumption was that each argu- 
ment would add a compelling reason for per- 
forming the discrepant act so that the inducing 
force would be greater in the high-justification 
condition than in the low-justification con- 
dition. As expected, greater dissonance and 
consequent attitude change toward the dis- 
crepant position occurred in the low-justifica- 
tion condition. A further study by Rabbie, 
Brehm, and Cohen provides similar evidence 
concerning the inverse relationship between 
justification for taking a discrepant stand and 
attitude change toward that stand.!9 

A further derivation from the theory con- 
cerns the amount of discrepant behavior en- 
gaged in. It would seem that, other factors 
held constant, the amount of dissonance cre- 
ated will be a function of the amount of dis- 
liked behavior engaged in. Dissonance may be 
reduced by decreasing the magnitude of dislike 
(or increasing the magnitude of liking) for the 
behavior. 

'The experiments mentioned above were con- 
cerned with verbal processes. Another experi- 
ment, by Brehm, was designed to test some 
predictions concerning attitude change as a 
function of overt behavioral compliance." 
Eighth-grade boys and girls were induced to 
eat a vegetable they had previously indicated 
they heartily disliked. For those who were told 
that the vegetable had low food value, the more 
they had to eat the more they liked it. Thus 
the dissonance engendered by choosing to eat 
a disliked vegetable can be reduced or elimi- 
nated by making one’s attitude more consistent 
with one’s behavior. 

Taken together, these results clearly support 
the applicability of dissonance theory to this 
problem. When an individual is induced to 
make a public stand different from what he 


10 J. M. Rabbie, J. W. Brehm, and A. R. Cohen, 
“Verbalization and Reactions to Cognitive Disso- 
nance,” Journal of Personality, Vol. 27, 1959, pp. 
407-417. 

11 J. W. Brehm, “Attitudinal Consequences of Com- 
mitment to Unpleasant Behavior,” Journal of Ab- 
normal and Social Psychology, in press. 


privately believes, dissonance or psychological 
tension will result. This tension will tend to 
be great where the balance between the oppos- 
ing cognitions is about equal. The more com- 
pelling the forces leading to the discrepant 
expression, the less tension will result from 
making the discrepant expression. Thus increas- 
ing rewards or justifications for taking a dis- 
crepant stand will lead to decreasing dissonance; 
decreasing the motivation (beyond that neces- 
sary to elicit the behavior) will lead to greater 
dissonance. In any case, whenever the tension 
does exist, it can be relieved by making one's 
private opinion more consistent with the view 
expressed; the greater the dissonance the 
greater the attitude change. 


The Importance of Choice 

The assumptions presented above point to 
phenomena of post-decision behavior, to what 
happens to the person's attitude after he has 
chosen to comply with the request to engage 
in discrepant behavior. It should therefore be 
apparent that dissonance theory has explicit 
implications for situations of "free choice," 
where the person chooses between alternatives. 
According to this theory, all cognitive elements 
that favor the chosen alternative are consonant 
and all cognitive elements that favor the un- 
chosen alternative are dissonant. Other things 
being equal the greater the number of ele- 
ments favoring the unchosen alternative (the 
greater its relative attractiveness), the greater 
the resulting dissonance. Dissonance can be 
reduced by making the chosen alternative more 
desirable and the unchosen alternative less 
desirable after choice than they were before. 

Brehm and Brehm and Cohen have shown 
that an increase in the magnitude of disso- 
nance and consequent pressure to reduce it by 
increasing the relative attractiveness of the 
chosen alternative depend at least upon (1) 
an increase in the degree to which the alterna- 
tives approach equal desirability, (2) a greater 
number of alternatives, and (3) greater quali- 
tative dissimilarity or lack of cognitive overlap 
among alternatives.12 

Given the centrality of the act of choice in 


12 J. W. Brehm, "Post-decision Changes in the De- 
sirability of Alternatives," Journal of Abnormal 
and Social Psychology, Vol. 52, 1956, pp. 384-389; 
J. W. Brehm and A. R. Cohen, "Re-evaluation of 
Choice Alternatives as a Function of Their Num- 
ber and Qualitative Similarity,” Journal of Abnor- 
mal and Social Psychology, Vol. 58, 1959, pp. 373- 
378. 
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either selecting an alternative or complying to 
an induced discrepant behavior, the question 
arises as to the importance of the act of volition 
for the creation of cognitive dissonance. The 
theory certainly implies that a person who is 
forced to behave in a manner he would avoid 
if he could experiences no dissonance. In this 
case maximum force would be motivationally 
equivalent to maximum financial inducement, 
and both should produce little consequent atti- 
tude change. On the other hand, a fait accompli, 
ie. an event beyond the person's control, might 
conceivably produce dissonance if the same 
event would have led to the opposite behavior 
had it been predictable at the choice point. 

Brehm reports an experiment in which a 
fait accompli does appear to have increased 
cognitive dissonance.18 The previously men- 
tioned study on behavioral compliance pro- 
vided a framework for testing this hypothesis. 
The theoretical expectations were that if a per- 
son is induced by the promise of a small prize 
to eat a disliked food he should experience 
dissonance. A person who is experiencing dis- 
sonance should try, in proportion to its magni- 
tude, to reduce it by convincing himself that 
he likes the food more than he originally 
thought. If, then, after the initial decision to 
eat has occurred, a fait accompli shows him 
that he must expect to eat more, and this is 
accompanied by increased liking, it may be 
concluded that a fait accompli can affect 
the magnitude of dissonance. Thus, after the 
eighth-grade children were induced to eat the 
disliked vegetable, some were additionally told, 
while eating, that their parents would learn 
what vegetable they had eaten, with the impli- 
cation that they would have to eat more at 
home; others were told nothing further. A 
questionnaire measured liking before and after 
the experiment. It was found that the fait ac- 
compli of implied further eating produced a 
greater increase in attitude change toward lik- 
ing the vegetable. 

Thus, Brehm's experiment demonstrates that 
a chance event can affect the magnitude of dis- 
sonance. However, since choice was not varied 
it is not clear whether the effects of such an 
event depend upon there having been a prior 
relevant choice. Brehm and Cohen report an 
experiment designed to explore some of these 


13 J. W, Brehm, “Increasing Cognitive Dissonance 
by a Fait Accompli,” Journal of Abnormal and So- 
cial Psychology, Vol. 58, 1959, pp. 379-382. 


limiting conditions under which dissonance, 
and consequent attitude change, is produced.14 
In their experiment, perceived choice in sub- 
mitting to an unpleasant event and a chance 
relative deprivation introduced after commit- 
ment were varied. Subjects were asked to and 
did commit themselves to a very disagreeable 
and time-consuming project. The results indi- 
cated that, under conditions of low perceived 
choice, the more relative deprivation they suf- 
fered, the more negative were their attitudes 
toward the project. The authors interpret this 
as a straightforward effect of motivation. How- 
ever, under conditions of high perceived choice, 
the more relative deprivation the subjects suf- 
fered, the more positive were their attitudes 
toward the event. The latter effect is a clear 
expectation from dissonance theory: A person 
who chooses to behave in a way he would ordi- 
narily avoid experiences dissonance; the more 
unpleasant the induced behavior, the greater 
the magnitude of dissonance and the greater 
the consequent attitude change in order to re- 
duce it. Thus, a chance event or a fait ac- 
compli may be seen to affect the magnitude 
of dissonance in taking a discrepant position, 
but presumably only under conditions of high 
prior choice. 

A further experiment by Cohen, Terry, and 
Jones also demonstrates the importance of 
choice.!5 It shows that at least some prior deci- 
sion in which the person commits himself to 
a general negative situation is necessary for 
the creation of dissonance. In this experiment, 
choice in exposure to counterpropaganda was 
varied and the person's opinion on a salient 
attitude dimension was measured. Half the sub- 
jects who were extreme in their opinions and 
half who were moderate were exposed to in- 
formation which they knew to be counter to 
their positions; the other halves of these groups 
were given a choice about exposing themselves 
to the contrary information, After exposure, 
their opinions were measured again. The re 
sults show that under conditions of low choice 
a direct resistance effect was obtained: the 
greater the discrepancy between the propa- 


14 J. W. Brehm and A. R. Cohen, "Choice and 
Chance Relative Deprivation as Determinants of 
Cognitive Dissonance,” Journal of Abnormal and 
Social Prychology, Vol. 58, 1959, pp. 383-387. 

18 A. R. Cohen, H. I. Terry, and C. B. Jones, “Atti- 
tudinal Effects of Choice in Exposure to Counter- 


propaganda," Jowrmal of Abnormal and Social 
Psychology, Vol. 58, 1959, pp. 388-391. 
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ganda and their initial opinion, the less the 
attitude change. However, under conditions of 
high choice, expectations from dissonance the- 
ory were fulfilled: Having chosen to behave in 
a discrepant fashion (chosen to expose them- 
selves to contrary information), the greater the 
discrepancy between the information and their 
initial opinion the greater the dissonance and 
consequent attitude change. 

An interesting experiment by Davis and 
Jones also illustrates the importance of prior 
choice in creating dissonance.!9 In this experi- 
ment, subjects changed their attitudes to be- 
come more negative toward a stimulus person, 
after publicly making a negative evaluation of 
him, only when they were given some choice 
in engaging in the behavior and assumed that 
their behavior was irrevocable (i.e. they did not 
anticipate explaining the behavioral deception 
to the stimulus person). 

It should be clear that, in all these experi- 
ments, though choice is manipulated, subjects 
do not select themselves in complying or not 
complying. Although the subject perceives that 
he has a great deal of choice, he is not per- 
mitted to escape from the experimental situa- 
tion. This eliminates the bias of self-selection. 
All subjects comply, even though, depending 
upon the experimental condition to which 
they are assigned, they may differ in their per- 
ceptions of choice in compliance. This issue 
will be discussed more fully later. 

In any case, these experiments indicate that 
the perception of commitment to a choice may 
be a necessary precondition for the creation of 
cognitive dissonance. Where choice is varied, 
expectations from dissonance theory are ful- 
filled only under high-choice conditions; under 
low-choice conditions, straightforward motiva- 
tional or resistance effects seem to account for 
the results. These findings imply that the mech- 
anism through which variations in motivational 
inducements are operating in producing varia- 
tions in attitude change may conceivably be 
the person's subjective perception of the choice 
he has in submitting or exposing himself to 
the undesirable event. Thus, it is possible that 
large rewards give the person the feeling that 
he cannot turn down the request. Small re 
wards, just over the threshold necessary to in- 


16 K. E. Davis and E. E. Jones, "Changes in Inter- 
personal Perception as a Means of Reducing Cog- 
nitive Dissonance,” Journal of Abnormal and Social 


Psychology, in press. 


duce the discrepant behavior, may raise the 
subject's subjective perception of choice, i.e. 
raise the issue of his own control over the situ- 
ation. Having chosen to take the discrepant 
stand, for whatever reason, there is dissonance 
between the event and his own actions in com- 
mitting himself to it. The less the reward when 
he complies, perhaps the greater the subjective 
perception of choice, and the greater the conse- 
quent dissonance and attitude change. 

These speculations are based only upon the 
most general inferences from the research dis- 
cussed. To be more certain about these no- 
tions, it would be necessary to demonstrate 
two relationships: that variations in reward do 
lead to variations in subjective perception of 
choice; and that variations in choice do lead 
to variations in attitude change. 

An experiment by Cohen and Brehm deals 
with this issue by studying the effects of coer- 
cive inductions.!? It is clear from the data that 
variations in coercion (as means of inducing 
compliance in taking a discrepant stand) do 
lead to variations in perceived choice and atti- 
tude change. But the fact that coercion has 
complex effects depending upon the perceived 
reasonableness of the situation and the per- 
ceived legitimacy of the coercive agent made 
worthwhile a further but more simple demon- 
stration using positive inducements. 

On this point, Cohen performed two pilot 
experiments which are related in that they are 
both concerned with the variable of perceived 
choice in situations where persons comply to 
undesirable events.18 In the first experiment, 
reward was manipulated, the subject took a dis- 
crepant stand by writing an essay against his 
own position on a salient issue, and his per- 
ceived choice as well as his attitude change on 
the issue were measured. In the second experi- 
ment, choice was manipulated directly, the sub- 
ject took a discrepant stand by making a re- 
corded speech against his position on another 
issue, and his attitude change was measured. 
The data tentatively confirm the above hypoth- 
eses. They indicate that the lower the financial 
inducements for taking the discrepant position, 
the greater may be the subjective perception 


17 A. R. Cohen and J. W. Brehm, “Coercion, 
Choice and Attitude Change,” Yale University. 
1958, unpublished manuscript. 

18 A. R. Cohen, “Forced Compliance through Posi- 
tive Inducements, Perceived Choice and Attitude 
Change,” New York University, 1959, unpublished 
manuscript, 
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of choice, and the greater the perception of 
choice in taking the discrepant stand, the 
greater may be the attitude change toward the 
discrepant position. 

Thus it would seem that a good many of the 
experiments in which dissonance has been pro- 
duced and studied can be integrated under the 
general rubric of situations in which there is 
subjective perception of choice. Examples of 
such experiments are: free-choice situations in 
which subjects choose between alternatives;!9 
forced-compliance studies like those of Brehm, 
Festinger and Carlsmith, and Cohen, where re- 
wards have been used;20 experiments where 
deprivations and coercions have been manipu- 
lated to produce compliance;?! studies where 
justification inducements have been used;22 and 
studies in which choice in exposure to counter- 
propaganda has been manipulated.23 The use 
of such a rubric might make possible more gen- 
eral theoretical statements. While the dissonance 
formulation as it stands deals with bundles of 
cognitions in opposition and is conceptualized 
at a very abstract level, a more adequate formu- 
lation might point to the conflict within the 
person as different aspects of his self become 
embroiled in opposition as a result of a choice 
or commitment on his part. 


THEORETICAL STATUS OF DISSONANCE 
RESEARCH: PROBLEMS AND LIMITATIONS 


The research mentioned above illustrates the 
power of the dissonance formulation to explain 
attitudinal effects of commitment to discrepant 
and other positions which produce dissonance. 
However, a number of problems should cause 
us to accept the general case with some caution. 
In the present section some of these problems 
and limitations will be briefly mentioned. 


The Specificity of Decision 
One limitation in the present research centers 
around the necessity in the dissonance formu- 


19 Brehm, “Post-decision Changes in the Desira- 
bility of Alternatives"; Brehm and Cohen, "Re- 
evaluation of Choice Alternatives." 

20 Brehm, “Forced Verbal Compliance and Attitude 
Change"; Festinger and Carlsmith, op. cit.; Cohen, 
“Forced Compliance through Positive Inducements, 
Perceived Choice and Attitude Change.” 

21 Brehm and Cohen, “Choice and Chance Relative 
Deprivation”; Cohen and Brehm, “Coercion, Choice 
and Attitude Change.” 

22 Cohen, Brehm, and Fleming, op. cit.; Rabbie, 
Brehm, and Cohen, op. cit. 

23 Cohen, Terry, and Jones, op. cit. 


lation to be pro or con and to take a clear 
stand on a contrary issue. While dissonance 
theory may work well where there are such 
clear-cut alternatives, it may not be so helpful 
where compromise positions may be taken. It 
seems reasonable that dissonance reduction via 
attitude change is dependent upon the indi- 
vidual's decision to commit himself to a dis- 
crepant position. 


The Psychology of Choice 


A related but more serious question concerns 
the theoretical status of choice. It is often not 
clear from the theory and experimental work 
just what creates the basic state of tension or 
dissonance. It is, for example, the discrepancy 
between a prior attitude and an undesirable 
behavior which creates dissonance or the dis- 
crepancy between a prior attitude and the in- 
dividual's feelings of responsibility for having 
chosen to do something contrary to his cogni- 
tions? These issues raise the question of whether 
it is necessary to extend the conceptualization 
of dissonance into the sphere of conflicts within 
the individual as his selffeelings become en- 
gaged after having initiated behavior which is 
contrary to his cognitions. Another relevant 
question concerns the possible similarities and 
distinctions between the psychological pro- 
cesses involved in choice and those involved in 
commitment. Most generally, these issues con- 
cern more rigorous specification of just "what" 
the individual is choosing to do when he agrees, 
for example, to comply with or commit himself 
to a discrepant event. 


Conflict versus Dissonance 


Another difficult problem concerns the dual 
theoretical notions of conflict and dissonance. 
With regard to the free-choice situations, à 
conflict interpretation might claim that re- 
evaluation of alternatives occurs before choice, 
thereby changing the approach-avoidance gra- 
dients for the alternatives, and allowing finer 
discriminations so that a choice can be made. 
While both predecisional re-evaluation before 
choice owing to resolution of conflict and post- 
decisional re-evaluation after choice owing to 
resolution of dissonance undoubtedly occur, it 
is not clear to which process and therefore to 
which theoretical notion the major share of re- 
evaluation may be attributed. More realistically, 
it is at least important to specify the conditions 
under which one or the other process might be 
expected to occur. 
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It is also possible to attempt an‘ interpreta- 
tion of the forced-compliance studies through 
the use of conflict theory assumptions. How- 
ever, this would involve a coordination of the 
conceptual variables of conflict theory to those 
of dissonance theory. For example, as Brehm 
indicates, the dependent variables of attitude 
change would have to be directly related to the 
approach and avoidance gradients using rein- 
forcement principles or they would have to be 
related to an assumed intermediate motive that 
might be called “conflict tension." In any 
case, the task is a difficult one primarily be- 
cause there exists no one conflict theory and 
because what exists has not laid the ground- 
work for the application of reinforcement prin- 
ciples to the effects of choice among alternatives. 
Nevertheless, this would appear to be an area 
for future crucial theory and experimentation. 


Alternative Methods of Dissonance Reduction 


While the theory predicts attitude change 
because of its assumption that behavior is rela- 
tively fixed and attitudes relatively flexible, it 
does not specify which of several methods of 
dissonance reduction other than attitude change 
might be expected in a given instance. In most 
of the studies cited, persons were experimen- 
tally prevented from dissociating themselves 
from their remarks, from repudiating their be- 
havior, from leaving the field, from making 
negative attributions to others, from repudiat- 
ing the experimenter, and so forth. (Zimbardo's 
paper illustrates the problems involved in clos- 
ing off other avenues of dissonance reduction.25) 
However, since these are all devices for reduc- 
ing the tension aroused by discrepant behaviors, 
specification of their comparative utilization de- 
pending upon the individual and upon situa- 
tional limitations should be more explicitly 
built into the theory. Thus, for example, it 
might be possible to relate given ego-defensive 
behavior to a given method of dissonance re- 
duction, as characteristic individual reactions to 
tension or conflict in discrepant situations. A 
recent experiment by Brock has made a begin- 
ning along these general lines: it deals with 
the restructuring of cognitive organization as 
an alternative to dissonance reduction through 
attitude change.?6 


24 Brehm, "Attitudinal Consequences of Commit- 
ment to Unpleasant Behavior." 

25 Zimbardo, op. cit. 
26 T. Brock, “The Role of Cognitive Restructuring 


Creating the Conditions for Choice 


Still another problem concerns the experi- 
mental fostering of the individual's belief that 
he has relative freedom of choice about whether 
or not he complies with the discrepant request. 
If the person feels that he has been coerced, 
little dissonance and consequent attitude 
change will result. On the other hand, if sub- 
jects are actually allowed to agree or disagree 
freely with the request to comply, few may 
comply. After all, why should anyone agree to 
support a position with which he disagrees or 
engage in behavior which is unpleasant to him? 
To the degree that some people are allowed to 
refuse and some to comply, self-selection be- 
comes a serious limitation of the experiment. 
Therefore, a great deal of experimental inge- 
nuity must be exercised if the subject is to be 
made to feel that the choice is his, while at the 
same time the experimenter must be careful not 
to allow him to slip out of the experiment. 
Depending upon the experimental situation,?7 
different procedures have been adopted to cope 
with this problem. 


Definitions and Conceptualizations 


For the most part, the above studies have 
made the most general assumptions about the 
conceptualization of cognition, attitude, opin- 
ion, choice, and so forth. Emphasis has usually 
been placed on the prior conditions of disso- 
nance creation and their effects upon a verbal 
statement of opinion. The task of defining atti- 
tude as distinct from opinion, or the explicit 
separation of public from private opinion, has 
not been taken up here. Obviously, this imposes 
a limitation on the present research which, it 
is hoped, future work on communication and 
attitude change will correct. 


Alternative Explanations 


In a number of the studies mentioned, there 
are particular problems of interpretation. For 
example, in the “effort” and “choice in expo- 
sure” experiments, communication discrepancy 
was effected not through experimental manipu- 
lation but through premeasure of the subjects’ 
attitudes. Because of this, it is possible to ascribe 
the effects in the high-choice and high-effort 


in the Attitude Change Process," New Haven, Yale 
University, 1960, unpublished doctoral dissertation. 
27 E.g. Festinger and Carlsmith, op. cit., and Cohen, 
“Forced Compliance through Positive Inducements, 
Perceived Choice and Attitude Change.” 
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conditions not to dissonance but, for example, 
to an acquiescence effect. Thus, subjects in 
these conditions could have been disposed to 
take the counterpropaganda more seriously be- 
cause they had chosen to expose themselves to 
it or felt that they had expended effort to un- 
derstand it. They would then change more, the 
more change possibilities were inherent in their 
initial scale positions. A more satisfactory pro- 
cedure, which would close off such an alter- 
native explanation, would be to manipulate 
communication discrepancy through presenta- 
tion of contrary communications of different 
strengths to subjects at all initial scale positions. 


It is particularly tempting to concoct “per- 
sonality explanations” for the results presented 
in the foregoing pages. Such reinterpretations 
could center around factors like "defensive ra- 
tionalizations of conflicts occasioned by com- 
pliance" or "common personality sources of 
discrepant expression and various forms of atti- 
tude change." The difficulties with such alter- 
native explanations are: 


l. They are often post hoc; in such cases a 
reconstruction can be attempted only when a 
behavioral datum is given beforehand. It is 
usually difficult to point to any given behavior 
in advance as relevant or interesting or pre- 
dictable. 

2. Such explanations usually focus on re- 
stricted aspects of the experimental situation, 
often ignoring the situational factors which 
place demands and limits upon the individual. 

3. They are nonspecific, in that they can 
usually be applied to most experimental situ- 
ations, without necessarily distinguishing from 
other situations those typically dissonance- 
producing situations in which post-decision be- 
havior is treated. 

4. They usually involve different kinds of 
explanation for each particular experiment, so 
that an interpretation involving a given de- 
fense may be applied to one experiment, a dif- 
ferent defense applied to another, a given 
conflict to a third, and hostility, for example, 
to a fourth. 


5. The personality mechanisms isolated are 


not necessarily coordinate to the dissonance sit- 
uation. Why, for example, should a “randomly 
chosen” psychosexual conflict or ego defense 
necessarily have implications for reactions to 
discrepant situations? The personality construct 
studied ought to be tailored to the dissonance- 
producing situation. Thus the selection of some 
notion like “need for cognitive consistency,” 
for example, where the cue-arousing properties 
of the situation are especially relevant for the 
internal characteristic, might increase explana- 
tory power. 


In any case, it should be said that though 
there may be problems of interpretation in any 
experiment, the results of all experiments tend 
to be consistent with each other and with the 
theoretical model from which they were derived. 
The virtue of the dissonance formulation is 
that the different experiments are derived from 
and assimilated to a fairly general body of 
propositions. 


Tolerance for Dissonance 


The discussion above does not imply that 
“personality” explanations are valueless and 
irrelevant to the present discussion, One can 
isolate personality constructs which may be the- 
oretically coordinate to the dissonance phenom- 
ena and particularly appropriate to study in 
interaction with them, rather than select an 
“arbitrary” construct from “general personality 
theory.” Festinger, for example, has discussed 
the variable of what he calls “tolerance for 
dissonance," a personality variable which he 
feels has implications for behavior in disso- 
nance-producing situations.*8 Individuals might 
be expected to differ in the magnitude of a 
given reaction like attitude change depending 
upon how much dissonance was created in them 
by a forced-compliance situation. The experr 
ments discussed here have not studied indi- 
vidual differences in the degree to which 
dissonance is experienced in a given dissonance- 
producing situation. However, specification of 
these different personality styles might sharpen 
predictions about behavior in such situations. 


28 Festinger, op. cit., pp. 266-271. 


37. Symbolic Psycho-Logic: A Model of Attitudinal Cognition 


ROBERT P. ABELSON and MILTON J. ROSENBERG 


Several lines of theoretical and research inter- 
est have gradually been converging upon the 
study of “cognitive structure,” especially in the 
area of social attitudes. Attitudes, by the defi- 
nitions of Smith, Bruner and White (15), Peak 
(12), Rosenberg (14), Green (6) and many others, 
involve both affective and cognitive components. 
These components interact intimately with one 
another, so that cognitions about attitudinal 
objects are not felt to be meaningfully analyz- 
able without consideration of affective forces. 
Theorists are reluctant even to consider cogni- 
tive units of an attitude apart from other cog- 
nitive units, preferring to treat cognition as 
“structured” into meaningful wholes (10, 16). 
Krech and Crutchfield (9, p. 152) define attitude 
globally as "an enduring organization of moti- 
vational, emotional, perceptual and cognitive 
processes with respect to some aspect of the in- 
dividual's world.” They state (p. 108) “Each of 
our perceptions does not lead ‘a life of its own’ 
but is embedded in an organization of other 
perceptions—the whole making up a specific 
cognitive structure.” Smith, Bruner, and White 
(15, p. 286), in discussing attitude measurement, 
caution, “Recent advances in the theory and 
technique of attitude measurement . . - have 
yet to take this complexity [of opinions] into 
account, Characteristically, they have conceived 
of attitudes as a matter of pro-ness and con- 
ness. . . . We would question whether it is 
reasonable to expect them to generate the di- 
mensions required for an adequate description 
of opinion. [An adequate description] comes 
rather from intensive explorations. . . .” Asch 
(2) is another author who has employed a ho- 
listic concept of attitude structure. He seeks to 
explain certain attitude change phenomena in 
terms of the “restructuring” of the attitudinal 
object (3). Rokeach (13) treats structure as an 


* Excerpted from Behavioral Science, 1958, 3, 1-8, 
with permission of the senior author and the pub- 
lisher. 


individual difference variable of cognitive style, 
defining one extreme, dogmatism, as "a rela- 
tively closed cognitive organization of beliefs 
and disbeliefs about reality . . . which .. . 
provides a framework for patterns of intoler- 
ance and qualified tolerance toward others." 

This Gestalt emphasis, motivated by the 
desire to construct an accurate model of the 
phenomenal organization of attitudes, has un- 
derscored the difficulties inherent in attempts 
to deal analytically with a set of cognitive units 
or elements. Nevertheless, if psychologists are 
to talk about "cognitive structure," they must 
do something about it; there must be a cor- 
respondence between theory and some sort of 
research operation. At least two important re- 
cent theoretical developments, Festinger's (5) 
“dissonance theory," and Heider's (7) "theory 
of cognitive balance," deal conceptually with 
the effects of organizing forces and affective 
forces upon cognitive elements. Both theories 
are concerned with changes in cognitive struc- 
ture. Festinger's theory deals mainly with in- 
consistencies between belief and action, and 
attempts to specify certain circumstances under 
which there will be more or less change in be- 
lief as an outgrowth of cognitive "dissonance," 
due to such inconsistencies. The research oper- 
ations which are used to test Festinger's theory 
are not addressed to the details of cognitive 
structure, but rather to single-response predic- 
tions derivative from the theory. Heider's theory 
is addressed to structural details of cognition, 
but there has been no research operation avail- 
able to investigate these details with particular 
attitude contents. On the other hand, the re- 
cently proposed methodological devices of Ro- 
senberg (14) and Abelson (1) are not completely 
anchored in a general theory of attitudes. A 
fresh approach to the measurement of cognitive 
structure is clearly called for. 

This paper proposes one such approach. We 
have drawn some inspiration from Cartwright 
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and Harary's (4) objective system for the exam- 
ination of "structural balance," from Heider's 
(7) ideas on cognitive balance, and are also 
intellectually indebted to Zajonc (17), who has 
suggested clever procedural devices in eliciting 
cognitive material. Otherwise, the system to be 
proposed is our own, albeit with a strong his- 
torical tether. The paper is divided into two 
parts: the "Psychological Model" and the 
"Mathematical System." 


THE PSYCHOLOGICAL MODEL 


Cognitive Elements 


Human thought, for all its complexity and 
nuance, must involve some cognitive represen- 
tation of "things," concrete and abstract. These 
things or concepts are the elements of our sys- 
tem. Though it does not seem absolutely neces- 
sary, we shall for convenience assume that 
individuals can attach some sore of verbal labels 
to the elements of their thinking. We will refer, 
then, to cognitive elements by verbal labels. 

We may distinguish three broad classes of 
elements: Actors, Means (Actions, Intrumental- 
ities), and Ends (Outcomes, Values). These 
classes are not completely exhaustive nor mu- 
tually exclusive. We propose this classification 
merely to suggest the variety of possible cogni- 
tive elements. To exemplify each class, consider 


the illustrative issue: “Having an honor system 
at Yale.” The following elements were among 
those frequently elicited from Yale students in 
a pilot study. 


Actors: myself, the Faculty, the Administra- 
tion, the Student Body, a certain minority of 
students, the honest student. . . . 

Means: the honor system, other honor sys- 
tems, the present examination system, cheating, 
reporting those who cheat, social pressure from 
other students. . . . 

Ends: the feeling of being trusted, mature 
moral standards, loyalty to friends, the Uni- 
versity's reputation, having wellrun examina- 
tions... . 


Note that all elements are expressed here sub- 
stantively to reflect their "thing-like" character. 


Cognitive Relations 


Of all the conceivable relations between cog- 
nitive elements, we choose to consider only 
four: positive, negative, null, and ambivalent. 
These will be denoted p, n, o, and a. (See also 
“Mathematical System.") We again conveniently 
assume that cognitive materials of a relational 
sort can be verbally labeled. Some typical ex: 
amples of relations between elements are shown 
in Table 1. Ambivalent relations (a) are defined 


TABLE 1. 
Positive (p) Negative (n) Null (0) 
Actor likes; dislikes; is indiffer- 
supports fights ent to 
Means uses; opposes; is not re- 
Actor vs. advocates undermines sponsible 
for 
End possesses; inhibits; is indiffer- 
aims for aims against ent to 
Actor helps; hinders; does not 
promotes insults affect 
Means is equiva- is alternative is unrelated 
Means vs. lent to to; counter- to 
acts 
End brings about prevents does not 
lead to 
Actor serves; is is inimical to does not 
vital to interest 
End vs. Means justifies obviates cannot en- 
sue from 
End is consistent is incom- is uncon- 
with patible with nected to 
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as conjunctions of positive and negative rela- 
tions; they are psychologically secondary or 
"derived," and have been omitted from the 
table. 

Note that relations are expressed here as 
verbs. 

Certain omissions from the above scheme 
may occur to the reader. Relations like "is next 
in line to," "is north of," etc. are certainly not 
included. Other phrases which imply relation- 
ship without specifying its sign, eg, “depends 
upon," "is connected with," etc. are difficult 
to classify. Furthermore, notions of time sense, 
moral imperative, and conditionality or prob- 
ability, e.g. “might have acted upon,” "prob- 
ably will help," "ought to seek," etc, are not 
dealt with explicitly. Time sense and condition- 
ality are beyond the scope of the present treat- 
ment. We deal here only with cognitions of the 
psychological present, and temporarily set aside 
the past, conditional past, future, and condi- 
tional future. As for the other exceptions to 
the scheme, note that their phrasing tends to 
be affectless. Dispassionate descriptions of, say, 
the mating habits of flamingoes or the opera- 
tion of a nuclear reactor or the economic situ- 
ation in Viet Nam would probably contain 
many relations impossible to classify. But such 
descriptions are reportorial, not attitudinal. 
When cognition is invested with affect, when 
the Actors, Means, and Ends are responded to 
emotionally, then the relations become classi- 
fiable in terms of the present system. Our in- 
tent is to be able to code all relations occurring 
in attitudinal cognitions into these four broad 
categories. 


Cognitive Units or Sentences 

Cognitive units are built out of pairs of ele- 
ments, connected by a relation. That is, the 
basic "sentences" of attitudinal cognition are 
of the form 


ArB 


where A and B are elements and r is a relation. 
Many sentences which at first seem more com- 
plicated than the simple ArB unit may be re- 
duced to such a unit by broadening the 
definition of an element. For example, consider 
the sentence, 

“Nasser (A) insists on (p) all Suez tolls (B) 
belonging to (p) Egypt (C)." (Here p denotes a 
positive relation.) This sentence, symbolically, 
is Ap(BpC). But regard (BpC) as a new element 
D, the broader conception “all Suez tolls be- 


longing to Egypt." Then we have simply ApD. 
In this way, we reduce our catalogue of basic 
sentences to four: 


ApB 
AnB 
AoB 
AaB 


The Conceptual Arena. The Structure Matrix 


For the convenience of both theory and oper- 
ation, we conceive of an attitude as being de- 
fined over a certain delimited though perhaps 
large conceptual arena. We restrict attention, 
in other words, to those elements which are 
(phenomenally) relevant to the given issue or 
attitude object. 

Operationally, what one can do is this. Fol- 
lowing Zajonc (17), we ask the subject to list all 
the words or short phrases that come to mind 
as he mulls over a given topic, such as “Having 
an Honor System at Yale” or “The State De- 
partment Ban on Reporters Going to Red 
China.” Subjects will typically write several 
phrases in rapid-fire order, then pause, give one 
or two more, and finally stop. These words or 
phrases are not always “noun-like,” but by sim- 
ple instructions to the subject, they can be 
reworded satisfactorily and be treated subse- 
quently as the cognitive elements. This proce- 
dure yields the conceptual arena idiosyncratic 
to each subject.t 

After the arena is defined, the relations be- 
tween pairs of elements may be mapped. We 
conceptualize a matrix setting forth the rela- 
tions between each element and every other. 
This matrix we denote the structure matrix, 
R (see the “Mathematical System"). As an illus- 
tration of a structure matrix, consider that 
shown in Table 2. 

The entries in this matrix were elicited by 
instructing a subject to fill in the middles of 
sentences linking his elements (the six listed). 
He wrote these “open-middled” sentences: 


The honest student feels very reluctant about 
reporting cheaters. 
The honest student possesses the feeling of be- 


ing trusted. 


1 For some experimental purposes, it is more con- 
venient to define a common conceptual arena for 
a group of subjects. Some topics yield low overlap 
between subjects, but other topics possess sufficient 
overlap so that it is worthwhile to construct a 
group arena encompassing the most generally sa- 
lient elements. 
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TABLE 2. Structure Matrix; Topic: Having an 


Honor System at Yale 


Honest 


Report Feel Cheating Honor 


Ego Student Cheater Trust byFew System 

Ego (the subject b p n p n o 
himself) 

The honest stu- [2 p n p o p 
dent 

Reporting n n p o n p 
cheaters 

Feeling trusted p p p o p 

Cheating by n o n o p n 
a few 

An honor system o p p p n p 


The honor system is fine for the honest student. 

The honor system promotes the feeling of being 
trusted. 

Cheating by a few is cut down by reporting 
cheaters, 

The honor system is harmed by cheating by a 
few. 

The honor system involves reporting cheaters. 


The verb forms in these sentences were coded? 
into the relations p and n shown in the matrix. 
In addition, the subject himself was asked 
whether he felt favorable, neutral, or unfavor- 
able toward each element. His responses are 
shown in the first row (column) of the matrix 
in Table 2. 

All relations between elements and them- 
selves are taken to be positive. Also, relations 
are considered to be symmetric; i.e., if ArB, 
then BrA. Our preliminary work does not sug- 
gest a pressing need to revise these assumptions. 

The structure matrix is our basic description 
of attitudinal cognition. One may wish to su- 
perimpose further refinements upon the struc- 
ture such as the intensity of different relations, 
the confidence with which they are held, time 
perspective, and so on. In this paper, we can 
answer only the simplest question, "In what 
way can the structure matrix be used to charac- 
terize the fundamental structural varieties of 


attitudinal cognition, apart from specific con- 
tent?” 


2 In order to circumvent the coding problem, one 
may require the subject to choose his verb forms 
from a precoded list. This version is presently be- 
ing attempted. Full details of the entire procedure 
will be set forth elsewhere. 


Thinking. Psycho-Logic 

Individual cognitive units may originate in 
various ways—through exposure to written in- 
formation, through social pressure, emotional 
need, thinking, etc. Once originated, these units 
may be manipulated in thought. 

We propose a set of rules by which an indi- 
vidual imputes or discovers new symbolic sen- 
tences by combining two old sentences with an 
element in common. We hypothesize that these 
rules apply only when the individual thinks 
about the topic, or “rehearses the arguments” 
(8, p. 129). 


Rule 1, ApB and BpC implies ApC. 

Rule 2. ApB and BnC implies AnC. 

Rule 3. AnB and BnC implies ApC. 

Rule 4. AoB and BrC implies nothing about 
the relation between A and C, irrespective 
of r. 

Rule 5. If ApC and AnC are both implied, or 
if one is held initially and the other implied, 
then AaC. This is the definition of the am- 
bivalent relation. 

Rule 6. AaC and CpD implies AaD. 

Rule 7. AaC and CnD implies AaD. 

Rule 8. AaC and CaD implies AaD. 


Each rule above has several equivalent forms 
not listed, arising from the symmetry assump- 
tion ArB — BrA. 

Though we have given formal status to these 
rules? note that the verbal corollaries are not 


3 The entire set of rules may be organized into one 
with the aid of Tables 1 and 2 of The “Mathemat- 
ical System." Letting R denote the structure ma- 
trix, derived or imputed sentences may be found 
from the matrix multiplication of R by itself, The 
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necessarily logical at all. Exemplify Rule 3 by 
the sentences: 


India (A) opposes (n) U. S. Far Eastern policy 
(B) 

U. S. Far Eastern policy (B) is directed against 
(n) Communism (C). 

Therefore, India (A) is in favor of (p) Com- 
munism (C). 


Such "reasoning" would mortify a logician, yet 
it can be found in much this form inside of 
millions of heads. Thus we speak of the formal 
system as psycho-logic rather than as logic. 
We assume that thinking about a topic must 
be motivated before it will occur. Some possible 
motivating conditions would be these: 


1. Pressure to reach a decision on the topic. 

2. Socially derived needs to appear informed 
on the topic, to converse well about it, to win 
over others, etc. Anticipation of the relevant 
social situations would motivate thinking. 

3. Relevance of the topic to needs, conflicts, 
and persisting preoccupations. Activation of 
such processes would generate pressure to think. 

4. A general "cognitive style" of the indi- 
vidual such that thinking per se is satisfying. 
For such individuals, mere mention of the topic 
might motivate thinking. 


Suppose that a given individual is motivated 
to think about a given topic. He already pos- 
sesses certain relevant cognitive units. Using 
psycho-logic, he imputes new units. These new 
units may or may not be compatible with his 
existing units. Referring back to the India- 
China example, if the individual had originally 
thought, “India (A) resists the influence of (n) 
Communism (C)," then he would be confronted 
with the coexistence of ApC and AnC. By Rule 
5, this is expressed AaC, the ambivalent rela- 
tion, which might translate back into words 
thus: 


The relationship between India (A) and Red 
China (C) is very confusing (a). 


yield imputations which are 


powers R2, R8, etc., had 
from the original sen- 


successively more removed 
tences. ; 
4 The potency of these various motivating condi- 
tions could be subjected to empirical test, using 
before and after measures of cognitive structure and 
experimental manipulation of motivating condi- 
tions in-between. One such experiment has in fact 
been carried out by Zajonc (17), using a different 
theoretical context. See his discussion of “cognitive 


tuning.” 


On the other hand, had he originally held ApC, 
then the imputed sentence ApC would reinforce 
rather than contradict his original view. 


Balance and Imbalance 


Certain structure matrices are characterized 
by the interesting property that no amount of 
thinking (i.e., imputing of new relations accord- 
ing to the rules above) leads the individual into 
any inconsistency (ie, ambivalent relation). 
The cognitive structures represented by such 
matrices are said to be balanced. When the 
structure matrix is such that one or more am- 
bivalent relations will be discovered in think- 
ing, the structure is said to be imbalanced. 
(Our statement of the mathematical condition 
for cognitive balance is given in the “Mathe- 
matical System.") In common-sense terms, a 
balanced cognitive structure represents a “black 
and white” attitude. The individual views some 
elements as good and the other elements as bad. 
All relations among “good elements” are posi- 
tive (or null), all relations among “bad ele- 
ments” are positive (or null), and all relations 
between good and bad elements are negative 
(or null). 


Redressing Imbalance 

Heider (7), Festinger (5), Osgood and Tan- 
nenbaum (11), and others have postulated that 
individuals strive to reduce or redress cognitive 
imbalance, or “dissonance.” This is probably 
especially true for attitudinal cognitions. It is 
important to note, however, that potential im- 
balance will remain undiscovered by an indi- 
vidual unless he is motivated to think about 
the topic and in fact does so. Assuming these 
necessary preconditions, suppose that an indi- 
vidual does come upon a cognitive inconsistency 
in his attitude. What can he do about it? Three 
things: 

1. Change one or more of the relations. 

2. Redefine, “differentiate,” or “isolate” one 
or more elements. 

3. Stop thinking. 

A simple example will illustrate the first two 
methods. The third method is self-explanatory. 
Take the three elements, E, C, G: Ego (a Yale 
student), Having Co-eds at Yale, and Getting 
Good Grades. The subject might tell us: 


"I'm for having co-eds at Yale.” (EpC) 

“I want good grades.” (EpG) 

“Having co-eds at Yale would undoubtedly 
interfere with getting good grades.” (CnG) 
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is imbalanced. Never having been forced to take 
a consistent stand on the issue, however, our 
subject may readily tolerate (or even be un- 
aware of) this imbalance. Now suppose that the 
issue is hotly debated and our subject thinks. 
The imbalance becomes apparent and he seeks 
a balance-producing resolution. 

Method l. He alters any one of the three 
relations, by abandoning the desire for good 
grades, by opposing the admission of co-eds, or 
by rationalizing to the effect that “Co-eds do 
not really interfere with getting good grades (in 
fact, they enhance the chances, etc.)” 

Method 2. (The following is one of various 
possibilities.) He differentiates the concept 
“Getting good grades” into "Getting A's" and 
"Getting C's" and then reasons that while co- 
eds may interfere with getting A's, they don't 
interfere with getting C's, and what he really 
wants is not to get A's but to get C's. 

It is assumed that under sufficient pressure to 
continue thinking, the individual will try Meth- 
ods 1 and 2, presumably seeking a relatively 
effortless means to achieve balance. If these at- 
tempts fail, because certain relations are resis- 
tant to change and certain elements are difficult 
to redefine, the individual may resort to 
Method 8 which is to stop thinking. If, how- 
ever, strong pressures, internal or external, do 
not permit him to stop thinking, he will re-ex- 
amine the topic, seeking a more complex util- 
ization of either or both of the first two meth- 
ods. And so on. With extremely strong pressure 
to continue thinking, some cognitive units will 
in all probability ultimately yield to one attack 
or another. With weak pressure and a structure 
that is highly resistant to change, the individual 
will most likely stop thinking, and his attitu- 
dinal structure will revert to its state before he 
started thinking. 

In other words, we envision an extensive 
hierarchy of cognitive solutions to the problem 
of reducing imbalance. Experimental predic- 
tion of outcome is extremely dificult under 
these circumstances, and represents a consider- 
able challenge. The next section presents a 


5 So-called “conversion experiences” may be repre- 
sentative of the extreme case: very extensive cogni- 
tive changes are characteristic of this phenomenon. 


crude preliminary step toward meeting the 
challenge. 


Experimental Prediction of Cognitive Changes 


Our present machinery is not refined enough 
to permit predictions with respect to Method 2 
above, “redefining the elements.” It would ap- 
pear feasible, however, to restrict experimen- 
tally the availability of this alternative [the 
arguments of Asch (3) to the contrary] and 
focus upon Method 1. 

The question we pose for ourselves is this: 
“Given an imbalanced structure® with highly 
stably defined elements, can we predict which 
particular set of relations in the whole concep- 
tual arena will be changed in order to achieve 
balance?” 

The mathematical system presented later in 
detail is helpful in providing what we hope will 
be an approximate answer. In that system, the 
complexity of imbalance of a structure is de- 
fined as the minimum number of changes of 
relations necessary to achieve balance. The 
system also implies a way of identifying which 
particular changes constitute the minimum set, 
or, when there are alternative minimal sets, 
identifies all of them. In addition, the "next 
to minimal” sets of changes can be identified, 
and so on. 

Working under the highly tentative heuristic 
assumption that all relations are equally resis- 
tant to change, the least effortful balancing 
operation on an imbalanced structure is the 
one that requires the fewest changed relations. 
A predicted set of changes can easily be gen- 
erated under this assumption. 


6 One may provoke imbalance experimentally by 
strong persuasive communication aimed at a given 
cognitive unit. We are presently launching an ex- 
periment of this type. 

7 Cartwright and Harary (4) have proposed a dif- 
ferent index for degree of structural imbalance. 
The present index is simpler to compute, and has 
also the appealing feature that it refers to the dy- 
namic property of change in structure. Harary, in 
a personal communication, has indicated that he 
has recently and independently hit upon an index 
equivalent to the present one. He calls it the “line 
index of balance.” A "line" in his terminology is 
a relation in our terminology, His “line index of 
balance” is the minimal number of lines (relations) 
whose negation yields balance. He points out that 
negation in the sense of changed sign and negation 
in the sense of deletion or elimination are equiv- 
alent insofar as balancing is concerned. Where we 
say “change of relation,” this may be taken to mean 
change from f to either n or o, and change from 
n to either p or o. 
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TABLE 3 


Honest 


3 4 5 6 
Cheat 
Report Feel by Honor 


Ego Student Cheater Trust Few System 


1. Ego p p 

2. The honest p p 
student 

3. Reporting n n 
cheaters 

4, Feeling p p 
trusted 

5. Cheating by n o 
a few 

6. An honor 0 p 
system 


n b n " 
n b o " 
b o n b 
o b o b 
n o p n 
b p 5 b 


sem 00oL——————————————— 


The steps involved in the analysis are: 


1. Write down the structure matrix. 
2. Find the row (column) with the largest 
plurality of n's over f's, exclusive of the p in 
the main diagonal of the matrix. Change all 
n's to p's and p's to n’s, in both the row and 
the corresponding column (leaving the diagonal 
entry p unchanged). Ignore the o entries. 

3. Repeat the previous step on à new row 
(column). 

4. Continue the change operations on se 
lected rows and columns until the number of 
n's in the matrix cannot be further reduced.8 


This irreducible minimum number defines the 
complexity of imbalance. Duplication due to 
symmetry is avoided by counting n's on one 
side of the diagonal only. The entries where 
the n's appear identify the minimal set. If the 
minimum number can be attained in more than 
one way, the sets so identified are alternative 
“solutions.” If the minimum number is zero, 
then the structure is balanced. The entire pro- 
cedure is based upon Theorem 13 of the “Math- 
ematical System.” 

The method will now be illustrated with the 
Yale student's structure on "Having an honor 
system at Yale," presented in an earlier section. 

His structure matrix is shown in Table 3. 


8 A certain amount of trial-and-error may be 


required before reaching a final decision. This deci- 


sion is not automatic; the change operation is some- 


times useful on a row an h 
of p's over n's—a temporary increase in the num- 
ber of n's can lead to an ultimate further reduction 


in the minimum number of n's. 


Row 5 contains three n's and one p. We then 
carry out the change operation on Row 5 and 
Column 5, carrying p into n (except in the di- 
agonal) and n into p. The result is shown in 
Table 4. 


TABLE 4 


Both remaining n's lie in Row 3. Three p's 
also lie in this row, but one of them is the 
unchanging diagonal element, so that essen- 
tially it is two n's versus two p's. The change 
operation on Row 3 and Column 3 would yield 
the matrix shown in Table 5. 


TABLE 5 


N 
eo 


1 
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The number of n's here is also two. It is evi- 
dent that the number of n's cannot be further 
reduced by the change operation. It is also evi- 
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dent upon further inspection that no other set 
of change operations will yield as few as two 
n's. Thus the structure matrix is of complexity 
two and there are two alternate minimal solu- 
tions to the balancing problem. 

The first solution tells us that had the sub- 
ject's initial relations between elements 3 and 
l, and 3 and 2 not been as they were, there 
would not have been any imbalance. In other 
words, achieving balance requires changing the 
relations between elements 3 and 1, and ele- 
ments 3 and 2. Looking back at his original 
structure matrix, this means that were he to 
favor reporting cheaters, or at least not be op- 
posed, and also regard honest students as favor- 
ing or not opposing reporting cheaters, his 
structure would be balanced. The "good ele- 
ments" would be Ego, The honest student, Re- 
porting cheaters, Feeling trusted, and The 
honor system, while Cheating by a few would 
be a “bad element.” In short, “We honest stu- 
dents would all feel trusted in an honor system. 
- A few cheaters might violate the system, but 
they should be reported.” 

The second solution tells us that the subject 
must change the relations between elements $ 
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In other words, were he to view Reporting 
cheaters as not identified with the honor system, 


and as in any case ineffective in cutting down: 


Cheating by a few his structure would be: bal- 
anced. Ego, The honest student, Feeling trusted, 
and The honor system would be "good ele- 
ments" and Reporting cheaters and Cheating 
by à few, "bad elements." A hypothetical quote 
would read, “We honest students would all feel 
trusted in an honor system. A few cheaters 
might violate the system, but I wouldn't feel 
right about reporting them and it probably 
wouldn't work anyway. There are a few lousy 
guys who will always try to foul things up." 

We make no claim that this subject would 
necessarily make one or the other of these 
changes in striving for a balanced structure. 
We only claim that if the strengths of all rela- 
tions are equal, the above changes would be 
preferred for their simplicity. 

We are now working on various means for 
empirical measurement of the strength of rela- 
tions. Incorporation of the strength variable in 
the model will represent an important refine- 
ment. 


l. Abelson, R. P. A technique and a model for multi-dimensional attitude scaling. Pub. opin. 


Quart., 1954, 18, 405-418. 


2, Asch, S. E. Social psychology. New York: Prentice-Hall, 1952. 

3. Asch, S. E., Block, H., and Hertzman, M. Studies in the principles of judgments and atti- 
tudes: I. Two basic principles of judgment. J. Psychol., 1988, 5, 219-251, 

4. Cartwright, D., and Harary, F. Structural balance: a generalization of Heider's theory. 


Psychol. Rev., 1956, 68, 277-993. 


5. Festinger, L. Theory of cognitive dissonance. Evanston: Row Peterson, 1957. 
6. Green, B. F. Attitude measurement. In G. Lindzey (Ed) Handbook of social psychology. 


Cambridge: Addison-Wesley, 1954. 


7. Heider, F. Attitudes and cognitive organization. J. Psychol., 1946, 21, 107-112. 
8. Hovland, C. I., Janis, I. L., and Kelley, H. H. Communication and persuasion. New Haven: 


Yale Univ. Press, 1958. 


9. Krech, D., and Crutchfield, R. Theory and problems of social psychology. New York: 


McGraw-Hill, 1948, 


10. Lewin, K. Field theory in social science. New York: Harper, 1951. 
ll. Osgood, C. E., and Tannenbaum, P. H. The principyleof congruity in the prediction of 


attitude change. Psychol. Rev, 1955, 62, 42-55. 


12. Peak, H. Attitude and motivation. In M. Jones (Ed.) Nebraska symposium on motivation, 


1955, Lincoln: Univ. of Nebraska Press, 1955. 


13. Rokeach, M. The nature and meaning of dogmatism. Prychol. Rev., 1954, 61, 194-204. 
M. Rosenberg, M. Cognitive structure and attitudinal affect. J. abnorm. soc. Prychol., 1956, 


55, 367-372. 


15. Smith, M. B., Bruner, J. S, and White, R. W. Opinions and personality. New York: Wiley, 


1956. 


16. Tolman, E. C. Purposive behavior in animali and men. New York: Century, 1982. 
17. Zajonc, R. B. Structure of cognitive field. Unpublished doctoral dissertation, Univ. of Mich- 


igan, 1954. 


pe 


38. Modes of Resolution of Belief Dilemmas 


ROBERT P. ABELSON 


INTRODUCTION 


This is a paper about intrapersonal conflict 
resolution. We first identify the kind of conflict 
to be considered. 

There are two levels of analysis of intraper- 
sonal conflict: the action level and the belief 
level, the former dealing with external motor 
responses and the latter with internal affective 
and cognitive processes. Particular instances of 
conflict may, for theoretical convenience, be 
localized at one or another of these levels. For 
example, one may ask how a person acts when 
simultaneously motivated to approach and to 
avoid an external object (3, 9, 10). Or one may 
ask instead what happens to the cognitive rep- 
resentation of an external object when the 
object simultaneously incurs favorable and un- 
favorable cognitions (12). The present paper is 
addressed to the latter type of question. We 
shall not consider the problem of whether and 
how the action level is to be reduced to the 
belief level or vice versa. We only consider 
conflicts between one belief and another or, 
more generally, conflicts within a belief struc- 
ture, The term "belief dilemma" is intended 
to enforce the distinction between the variety 
of conflict here considered and conflict in gen- 
eral. 


BELIEF DILEMMAS 


The model of cognitive structure to be de- 
scribed is similar at various points to other re- 
cent models (6, 8, 13). 

First, we imagine a cognitive 
a "cognitive element," corresponding to any 
attitude object. Associated with such a cognitive 
element is a numerical value, positive if the 


representation, 


* Reprinted from Journal of Conflict Resolution, 
1959, 3, 343-352, with the permission of the author 
and the publisher. 


object is liked, negative if the object is disliked. 
Next, we suppose that between each pair of 
cognitive elements there may exist some kind 
of perceived relation. Assigned to each relation 
is another numerical value, positive if the rela- 
tion is “associative,” negative if the relation is 
“dissociative” (11). Examples of associative rela- 
tions are: is, has, includes, likes, helps, produces, 
implies. Examples of dissociative relations are: 
avoids, hates, hinders, defeats, destroys, is in- 
compatible with. A zero value indicates a null, 
or irrelevant, relation. 

Given an attitude issue or "conceptual arena” 
(1), a certain set of cognitive elements would 
be relevant for a given individual. The set of 
relevant elements and the particular relations 
among them define the content of the indi- 
vidual's belief system on the issue. The form, 
or structure, of belief may be expressed inde- 
pendently of the content according to the array 
of numerical affect values and relation values 
defined above. 

A belief structure may or may not contain 
inconsistencies. By inconsistency is meant not 
logical inconsistency but psychological incon- 
sistency, or, as it has been variously referred to, 
imbalance (7), incongruity (12, 13), or disso- 
nance (5). We shall use the term “imbalance.” 

Heider (7), Festinger (5), and Osgood and 
Tannenbaum (12) have all postulated a motiva- 
tion for the reduction of imbalance. There is 
said to be a tendency, a pressure, toward the 
attainment of cognitive balance. An essential 
qualification to this postulate has been pointed 
out by Abelson and Rosenberg (1). There are 
innumerable inconsistencies in anyone's belief 
system which may lie dormant and unthought 
about. Pressure toward cognitive balance, if 
always operative on all cognitive elements, 
would produce much more balance in belief 
systems than one finds empirically. It is much 
more plausible to assume that this pressure op- 
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erates only when the issue is salient; that is, 
when the issue is being “thought about,” or, if 
this is too rational a terminology, when “cog- 
nitive work” is applied on the issue. General 
methods for identifying the presence of imbal- 
ance in a structure of any size have been given 
elsewhere (1, 4). Here we confine our analysis 
to a simple case of imbalance: two elements and 
the relation between them. 

There are six possible cases to be considered: 
two positively valued objects, related associa- 
tively or related dissociatively; one positively 
valued object and one negatively valued object, 
related dissociatively or related associatively; 
and two negatively valued objects, related asso- 
ciatively or related dissociatively. In each of 
these three pairs of cases, the first possibility is 
balanced, the second one is imbalanced. This 
may be clarified by reference to Figure 1. 


Balanced Imbalanced 


EU UE 
Figure 1. Cognitive structures with two elements 
and one relation. An unbroken line symbolizes an 
associative (positive) relation; a broken line a dis- 
sociative (negative) relation. The mechanism of 
denial aims toward the conversion of a structure 
on the right into one on the left, either through 
change of affect toward the element (“denial of the 
element”) or change in the sign of the relation 
(“denial of the relation"). 


An imbalanced dyad will be said to constitute 
a belief dilemma when the intensity of affect 
toward the objects is strong and when the dyad 
is often salient (i.e, often present in thought). 


Original structure 


MODES OF RESOLUTION 


Four possible modes of resolution are speci- 
fied below. Each can manifest itself in several 
ways. The modes are labeled: (a) denial, (b) 
bolstering, (c) differentiation, and (d) transcen- 
dence. 

Denial refers to a direct attack upon one or 
both of the cognitive elements or the relation 
between them. The value felt toward the object, 
whether positive or negative, is denied, or the 
opposite is asserted; or the sign of the relation 
between the elements is explained away, or the 
opposite is asserted. Examples are: the man on 
a diet professing that he never liked rich foods 
anyway, the groom convincing himself of his 
avid belief in his bride's religion, John Calvin 
interpreting the scriptures to show that Christ 
never really condemned usury. If an attempt 
at denial is successful, it will convert an im- 
balanced structure into a balanced one. How- 
ever, denial attempts may run into various 
difficulties, as, for example, when the denial is 
too great a distortion of reality or conflicts with 
other elements in the larger belief system. For 
example, the Boston colonists faced in 1773 
with the odious taxation on tea went so far as 
to vote that “it is the sense of this Body that 
the use of tea is improper and pernicious.” It 
is unlikely that this denial of the desirability 
of tea, albeit effective in encouraging group 
action, was effective in suppressing the taste for 
tea of the individuals concerned. 

The mechanism called “bolstering” consists 
of relating one or the other of the two cogni- 
tive objects in a balanced way to other valued 
objects (Figure 2), thereby minimizing the rela- 
tive imbalance in the structure, This mechanism 
plays an important part in Festinger's theory of 
cognitive dissonance (5). He points out many 
situations in which the introduction. of new 
cognitive elements is useful in reducing disso- 
nance. This is a mechanism not for eliminating 


Bolstered structure 


eG 


Figure 2. The mechanism of bolstering in reducing cognitive imbalance. In the 
bolstered structure (right) the units AC, AD, .. . are all balanced. The relative 
effect of the imbalanced unit AB is thus reduced. 
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balance entirely but only for drowning it out, 
so to speak. Examples are: the smoker who is 
worried about lung cancer telling himself that 
smoking is extremely enjoyable, good for his 
nerves, and socially necessary; and the propo- 
nent of a large standing army, unwelcome in 
peacetime, claiming that it is good character 
training for the nation’s youth. The mechanism 
of bolstering may be used in conjunction with 
the mechanism of denial. For example, in the 
example of the large standing army the advo- 
cate might also say that the large standing army 
was not contrary to peaceful purposes; in fact, 
it aided the cause of peace to have a large 
standing army. 

The two mechanisms listed thus far have the 
property that they preserve the identity of the 
cognitive elements. The meaning of the attitude 
objects remains the same even though attitude 
toward the objects may be weakened by denial 
or strengthened by bolstering. Another mode 
of resolution arises if we consider the possibility 
of differentiation of the cognitive elements. An 
element may be split into two parts with a 
strong dissociative relation between the parts. 


Original structure 
(imbalanced) 


| Prin 


(Bible) (Facts of evolution) 


To see how this mechanism might restore cog- 
nitive balance, consider the issue of hydrogen- 
bomb testing. For many people, continued 
hydrogen-bomb testing is positively valued, but 
poisoning of the atmosphere is negatively val- 
ued. These two cognitive objects are associa- 
tively related—there is a causal connection of 
some degree. This dyad is therefore imbalanced. 
But there is bomb testing and there is bomb 
testing: one might differentiate this attitude 
object into two—testing "dirty bombs" and test- 
ing "clean bombs." It is only the testing of 
dirty bombs that contributes to poisoning of 
the atmosphere; the testing of clean bombs pre- 
sumably does not. Thus the imbalance is re- 
solved. To take another example, the facts of 
evolution, positively valued, are contradictory 
to the Bible which is also positively valued. But 
there are two Bibles: the Bible as literally in- 
terpreted and the Bible as figuratively inter- 
preted. The Bible as figuratively interpreted is 
not contradictory to the facts of evolution but 
may be seen as concordant with them. In a 
third example, from an experiment by Asch (2), 
subjects who feel unfavorable toward “politi- 
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Figure 3. The mechanism of differentiation in restoring cognitive balance. 
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cians” are confronted with a highly prestigeful 
source who glorifies the political profession. 
Many subjects get off the hook by differentiat- 
ing statesmen (good politicians) from ward- 
heelers (bad politicians). In these examples one 
element is differentiated into two parts, a new 
part and an old part. The old part retains the 
relation with the other element in the structure, 
but the affect toward it is changed. The new 
part, on the other hand, retains the old affect to- 
ward the differentiated element, but the sign of 
the relation with the other element is changed. 
These changes are reviewed in Figure 3. 

It is interesting to note the large number of 
dimensions along which objects can be differ- 
entiated. They may be differentiated according 
to the internal content of the object, the object 
as viewed in a social context versus a personal 
context, the object as it is versus as it should 
be, the object as it is versus the object as it will 
be, etc. 

The mechanism of transcendence is in a sense 
obverse to the mechanism of differentiation. 
Elements, instead of being split down, are built 
up and combined into larger units organized 
on a superordinate level, as indicated in Figure 
4. For example, the dilemma pitting science 
against religion is transcended by the consider- 
ation that both the rational man and the spir- 
itual man must be jointly cultivated to reach 
a fuller life or a better society or a deeper 
understanding of the universe. Thus the di- 
lemma is transcended by imbedding the con- 
flicting parts in a new concept instrumental to 


Original structure 


A 
(Rational man) (Spiritual man) 


A B 
(The house (The house 
of Montague) of Capulet) 


some higher purpose. The theosophical di- 
lemma of God's presumed permissiveness toward 
evil is sometimes resolved by appeal to tran- 
scendent concepts. In the intriguing case study 
(6) of a group of individuals who prepared for 
a cataclysm that never occurred, it is reported 
that the group leader offered a transcendent 
résolution for the belief crisis: the cataclysm 
was said to have been stayed by God because 
of the group's devotion. 


CHOICE AMONG THE MODES OF 
RESOLUTION 


Presumably, the individual with an imbal- 
anced cognition will strive to choose among 
the various modes of resolution. Imbalanced 
structure would then be under a variety of 
pressures to change. The theoretical specifica- 
tion of which particular changes are likely to 
take place is a complex problem. Several work- 
ing propositions are sketched here. A more 
rigorous theory is in the process of development. 

Proposition 1. 'There will be a hierarchy of 
resolution attempts in general proceeding in 
the following order: denial, bolstering, denial, 
differentiation, and transcendence. 

The hierarchy of resolution attempts is based 
upon the relative ease of achieving success with 
each of the methods. The reason denial appears 
twice in the listing is that there are usually two 
points in the process at which denial may enter. 
If we consider the situation in which imbalance 
is introduced by forced or accidental exposure 
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Figure 4. The mechanism of transcendence in restoring cognitive balance. 
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to propaganda or opinions seeking to establish 
new cognitive relations or to contradict pre- 
viously held affect values, a first opportunity 
for denial may arise by a rejection of the rele- 
vance of the new material. If the initial denial 
fails, bolstering will be attempted and then an- 
other attempt at denial, this time buttressed by 
further thought about the issue. The presump- 
tion here is that denial and bolstering are sim- 
pler cognitive mechanisms than differentiation 
and transcendence, although they are not neces- 
sarily more effective in reducing cognitive im- 
balance. Differentiation is difficult because it 
requires intellectual ability, flexibility, and 
because, when there is strong affect toward a cog- 
nitive object, it is not easily split apart. Tran- 
scendence is presumably still more difficult, for 
it requires the existence of a compelling super- 
ordinate structure in which a given imbalance 
may be imbedded. 

Proposition 2. When two cognitive elements 
stand in imbalanced relation to each other and 
the affect toward one is more intense than to- 
ward the other, the tendency will be to apply 
bolstering toward the more intensely affected 
element and/or denial toward (a) the less in- 
tensely affected element and/or (b) the relation 
between the elements. 

Proposition 2a. The probability that an at- 
tempt will be made to bolster an element is 
high if other elements relevant to it are strong 
and stand in balanced relation to it (Figure 2) 
and is low if other relevant elements are weak 
or stand in imbalanced relation to it. s 

Corollary. Elements for which the indi- 
vidual's affect is intensely socially supported are 
readily subject to bolstering attempts when 
caught in a strong dilemma. 

Proposition 2b. The probability that an Rr 
tempt will be made to deny an element is high 
if other relevant elements are strong and stand 


Original delemma 


B 
(Smoking, heavy 


A 
(Deteriorating 
eating, drinking) 


health) 


Example 1. Note. The element C must. be compelling 


in imbalanced relation to it and is low if other 
relevant elements are weak or stand in balanced 
relation to it. 

Corollary. Elements with which considerable 
shame or guilt is associated (e.g. elements con- 
noting the overindulgence of appetites) are 
readily subject to denial attempts when caught 
in a strong dilemma (i.e, when firmly related 
in imbalanced fashion to a strongly affected 
element). See Example 1. 

Proposition 2c. Relations between cognitive 
elements are readily denied when the external 
evidence for the relation is remote, ambiguous, 
under suspicion of bias, or dependent upon 
specific circumstances which can readily be per- 
ceived as inapplicable in general. 

Proposition 2d. A relation between cognitive 
elements A and B is readily subject to denial 
attempts when there is available an element 4’, 
formally similar to A and standing in associa- 
tive relation with it, such that the relation be- 
tween A’ and B is of opposite sign as between 
A and B, and is stronger. 

i 2e. A relation between cognitive 
elements A and B is readily subject to denial 
attempts when there is available an element A’, 
formally similar to A, yet standing psycholog- 
ically in dissociative relation to 4, such that the 
relation between A’ and B is of the same sign 
as between A and B, but stronger (the “mote- 
beam” technique). 

See Example 2 (from the point of view ofa 
liberal but very proud southerner). 

Proposition 3. If the affects toward two cog- 
nitive elements which stand in imbalanced re- 
lation to each other are nearly equal and the 
resolutions suggested in Proposition 2 fail, the 
converse resolutions will be attempted; that is, 
there will be attempts to bolster the less intense 
element and/or to deny the more intense ele- 
ment. 
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stable. If original positive affect toward B is strong, the stability of the attempted 


resolution is jeopardized. 
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Proposition 4. The classical relationship be- 
tween the intensity and extremity of attitude 
may be explained in terms of a succession of 
dilemma resolutions in individual histories with 
the attitude object. 

Explanation. The mechanism of bolstering 
used in the service of dilemma resolution in- 
creases the intensity of affect toward the object. 
An object which has been repeatedly bolstered 
is therefore the object of an intense attitude. 
But repeated bolstering also increases the ex- 
tremity of attitude. In bolstering, the attitude 
object is connected with other objects. New 
reasons and supports are given for it; it is seen 
to be instrumental to other values; it is seen to 
be supported by various people and groups. In 


short, it is imbedded in a cognitive system of 
ever widening circumference. If scope of cogni- 
tive support is equated with extremity of atti- 
tude, then the relationship between extremity 
and intensity follows. 

Those individuals who do not invoke bolster- 
ing will in general be the ones with moderate 
attitudes of low intensity. 

Proposition 5. If, in the search for new ele- 
ments to bolster an original element in imbal- 
ance, further imbalance is created (usually 
because the new elements are imbalanced with 
each other), differentiation of the original ele- 
ment is encouraged. 

See Example 3 (it is pointed out to an indi- 
vidual that universal public school education, 
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Example 4 


by being compulsory, violates the democratic 
ideal of individual free choice—some parents 
might want to keep children out of school). 

Proposition 6. When an element is differen- 
tiated it is crucial to the maintenance of the 
resolution that the old part and the new part 
of the element be strongly dissociated. 

Proposition 7. Transcendent resolutions are 
likely to be invoked only in the case of chron- 
ically insoluble dilemmas. However, once a tran- 
scendent resolution is achieved, it may be found 
applicable to a variety of dilemmas. 

Proposition 8. Mass propaganda efforts seek 
dilemma resolutions effective for a large num- 
ber of people simultaneously. 

See Example 4 (a nation in a period of hard 
times). 

Revolutionary propaganda aims to bolster 4 
and differentiate B along a time dimension. 
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39. Cognitive Consistency and. Attitude Change 


WILLIAM J. McGUIRE 


There has been a recent resurgence of interest 
among psychological researchers in the predic- 
tive value of a postulate that people tend to 
maintain logical consistency among their cog- 
nitions (and even between cognitions and more 
gross behavior). The most influential single 
source of this revived interest is probably Hei- 
der's (1946, 1958) "balance" theory, while Fest- 
inger's (1957) theory of cognitive dissonance is 
probably the most stimulating elaboration to 
date. Most of the approaches that employ this 
postulate recognize, implicitly or explicitly, that 
the tendency towards consistency is not abso- 
lute. Its expression is limited by other, poten- 
tially conflicting forces in the person and by the 
inadequacies of his cognitive apparatus, even 
in the absence of conflicting forces, to achieve 
full consistency. These coordinate hypotheses of 
cognitive consistency and its predictable limita- 
tions have been applied successfully to the pre- 
diction of behavior in many areas. The work 
reported here involves two applications in the 
area of persuasion. 

The first of these applications involves what 
will be called the “Socratic method” of persua- 
sion. The persuasion technique usually studied 
in the social sciences involves inducing a change 
in a person's opinion by presenting him with a 
persuasive message from an external source, 
either through a "mass" one-way communica- 
tion channel or in a face-to-face reciprocal com- 
munication network. The postulate of cognitive 
consistency suggests that persuasion could be 
effected by the quite different technique of elic- 
iting the persuasive material from the person's 
own cognitive repertory, rather than present- 
ing it from outside. Specifically, this technique 
would involve asking the person his opinions on 
logically related issues, thus sensitizing him to 


* Reprinted from Journal of Abnormal and Social 
Psychology, 1960, 60, 345-853, with permission of 
uoc" and the American Psychological Associ- 
ation. 


any inconsistencies that exist among his stands 
on these issues and producing a change towards 
greater mutual consistency. The technique has 
an obvious affinity to the “Socratic method" 
used in the Platonic dialogues (Jowett, 1937). 

The derivation of the foregoing hypothesis 
requires, besides the postulate of a tendency to 
maintain cognitive consistency, the assumptions 
that various distorting factors (of which “wish- 
ful thinking" is given primary attention in this 
study) have introduced inconsistency among 
the initially elicited set of opinions, and that 
the temporally contiguous elicitation of these 
opinions brings the tendency towards consis- 
tency more strongly to bear upon them. 

Part of the study described below was de- 
signed to test the hypothesis that a person tends 
to change his opinions on logically related issues 
in the direction of greater mutual consistency 
when he is merely asked to state these opinions 
in close temporal contiguity. Several additional 
hypotheses regarding the focus of the Socratic 
effect, that is, the type of opinion most likely 
to be adjusted in case of inconsistency, were 
also tested. 

The second application of the cognitive con- 
sistency postulate to the area of persuasion in 
the present experiment dealt with the logical 
repercussions of persuasive communications. 
The effectiveness of a persuasive message is 
usually gauged by the amount of change in 
opinion that it induces on the issue with which 
it explicitly deals. From the postulated consis- 
tency tendency, however, it would follow that 
an induced change on the explicit issue tends 
also to produce changes on logically related 
(unmentioned) issues such as would tend to 
maintain internal consistency among beliefs. 

From the additional, limiting assumption 
that there is a certain amount of inertia in the 
person's cognitive apparatus, two additional 
hypotheses were derived. It was hypothesized 
that the consistency-maintaining change ex- 
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pected on remote issues is less than the amount 
logically required for complete consistency with 
the induced changes on the explicit issues. Fur- 
ther, it was hypothesized that inertia would 
also result in temporal effects, such that the 
impact of the message on the remote issue occurs 
only gradually, the opinion on the remote issue 
continuing to change in the logically required 
direction for some time after receipt of the per- 
suasive message. 

Since most of these hypotheses involve pre- 
dictions of fine graduations of change towards 
or away from a reference point of precise cog- 
nitive consistency, we need a quantitative defi- 
nition of consistency. Such a definition is 
provided by a model combining formal logic 
and probability theory, the relevant aspects of 
which are summarized below. 

Consider three issues so interrelated that a 
person taking Stands a and b on two of these 
issues would, if he were to be consistent, be 
logically required to take Stand c on the third. 
That is, the three stands are in a syllogistic re- 
lationship, with c following as a valid conclu- 
sion from the conjunction of a and b. If the 
person’s opinions on these issues are obtained 
by having him indicate his adherence to each of 
the Stands, a, b, and c, on a probabilistic scale, 
we can specify that to be completely consistent, 
these probabilistically scaled opinions must be 
interrelated in the form 


ple) = b(a& b) + p(k) —p(a& b) (1) 


where p(k) in his opinion of the probability of 
c on bases other than the conjunction of a and 
b. It can also be specified that if we induce a 
change of opinion on a and/or b (for example, 
by a persuasive communication) while p(k) does 
not change appreciably, then the required 
change on c necessary for maintaining logical 
consistency among the opinions is 


Ap(c) = Ap(a & b) (2) 


which, assuming that a and b are independent 
events, i.e, that p(a & b) — (a) p(b) becomes 


Ap(c) = Ap(a)p(b) + Ap(byp(a) + Ap(a)Ap(b) 
(3) 


where p(a) and p(b) refer to the initial opinions, 
before the communication induced changes. 
When, in the discussion below, we refer to the 
“logically required change on the conclusion,” 
we are referring to the Ap(c) value specified in 
Equation 3. 


METHOD 


Material. The Ss’ opinions on sets of logically 
related issues were measured by means of a 
questionnaire that contained 48 statements. 
These statements comprised 16 sets of three syl- 
logistically related propositions; that is, two of 
the propositions stated stands on a pair of issues, 
agreement with which would logically imply 
agreement with the stand taken in the propo- 
sition on the third issue.1 The three proposi- 
tions making up any one syllogism appeared in 
different parts of the questionnaire, being inter- 
spersed with the propositions of other syllogisms. 

The Ss were asked to indicate in probabilistic 
terms the extent of their adherence to the stand 
taken in each proposition. For this purpose each 
proposition was followed by a graphic scale con- 
sisting of a five-inch horizontal line marked off 
at half-inch intervals and calibrated 0, 10, 20 . . . 
90, 100. To the left of the scale was written 
“Very improbable end," and at the right, “Very 
probable end." The Ss were asked to decide 
how probable they felt the truth of each state- 
ment to be, and to indicate this subjective prob- 
ability by drawing a line through the scale at 
the appropriate point. 

In a later section of the questionnaire the 
propositions were repeated, this time with a 
scale consisting of five boxes labelled from left 
to right "very desirable," “somewhat desirable,” 

, "very undesirable." The $ was instructed 
to indicate his feeling regarding the desirability 
of each proposition by putting an X in the 
appropriate box. (As explained below, these 
ratings of desirability were obtained to allow 
identification of initial inconsistencies, due to 
wishful thinking, among the probability ratings.) 

The questionnaires were administered during 
each of the three experimental sessions. Be- 


1 A complete list of all syllogisms and persuasive 
messages used in the present study has been de- 
posited with the American Documentation Insti- 
tute. Order Document No. 6235, from ADI 
Auxiliary Publications Project, Photoduplication 
Service, Library of Congress, Washington 25, D. C.. 
remitting in advance $1.25 for microfilm or $1.25 
for photocopies. Make checks payable to Chief, 
Photoduplication Service, Library of Congress. 
One of the 16 syllogisms used was as follows. 
Students who violate any regulation that has been 
made to safeguard the lives and property of other 
students will be expelled; the regulations against 
smoking in the classrooms and corridors were made 
to safeguard the lives and property of the student; 
students who violate the regulation against smok- 
ing in the classrooms and corridors will be expelled. 
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tween the first and second administrations of 
the opinion questionnaires, a series of persua- 
sive messages were communicated to the Ss in 
mimeographed booklets.2 There were 16 such 
messages, each directed at one premise of each 
of the 16 syllogisms. Each message argued that 
the proposition in question was true, i.e., was 
aimed at increasing the S’s probabilistic esti- 
mate of the statement's truth. These messages 
were “one-sided,” 200-300 words long, employed 
rational appeals, and based arguments on fac- 
tual evidence. They were mild in tone and 
attributed to a purportedly competent and 
dependable source. 

Procedure. There were three 50-min. experi- 
mental sessions. In the first, the questionnaires 
were administered to obtain “before communi- 
cation” measures of opinions on the issues. The 
second session, which came one week after the 
first, was devoted to communicating the persua- 
sive messages and then to a second administra- 
tion of the opinion questionnaires in order to 
measure the immediate impact of the persua- 
sive messages. One week later, the Ss partici- 
pated in the third session during which the 
questionnaires were again administered (in or- 
der to ascertain the delayed effects of the per- 
suasive messages). At this final session the Ss 
were also told the purpose of the experiment 
and the E pointed out the deceits that he had 
employed in the experiments and his reasons 
for so doing. 

Until this final “catharsis,” an attempt was 
made to conceal, or at least distract attention 
from, the persuasion aspect of the experiment 
and from the investigator's interest in cognitive 
consistency. The study was purported to be an 
investigation of people's ability to understand 
experts’ discussions of controversial topics. The 
persuasive messages were alleged to be excerpts 
from such discussions, and after each message 
the Ss were required to answer a series of com- 
prehension questions (to emphasize the conten- 
tion that we were primarily interested in 
studying the Ss’ understanding of the material). 
The opinion questionnaires were explained as 
required to ascertain whether the issues dealt 
with were actually controversial in the group 
being studied and whether they continued to 
be controversial throughout the period of the 
experiment. 

Subjects and design. A total of 120 Ss par- 


2 The text of all 16 persuasive can be 
obtained in the ADI Document No. 6235, cited in 
Footnote 1. 


ticipated in the experiment. One-fourth of these 
were high school seniors and the remainder, 
college freshmen; the ratio of men to women 
was about 2:1 at both grade levels. To minimize 
the likelihood that the consistency hypotheses 
would receive specious confirmation by the em- 
ployment of atypically rationalistic Ss, these Ss 
were selected from a group with a rather low 
level of intellectual attainment. Their modal 
high school grade average was below the 30th 
percentile for all high school students in their 
state. Each of these students reported that he 
had never had a course in formal logic or in 
probability theory. 

In order that the 5s might have time to re- 
spond thoughtfully to the opinion items and 
to read the persuasive messages carefully, each 
$ was required to provide data on only 8 of 
the 16 syllogistically related sets of propositions. 
On only four of these eight did he receive 
persuasive messages, the other four serving as 
no-communication "control" items. These no- 
communication syllogisms provided the data for 
testing the Socratic effect hypotheses. The syl- 
logisms on which messages were communicated 
provided the data for testing the hypotheses 
concerning logical repercussions. Specific syllo- 
gisms were alternated between message and no- 
message conditions from S to S. 


RESULTS AND DISCUSSION 


Hypotheses Concerning the Socratic Effect 
Initial inconsistencies among the opinions. 
In order to identify initial opinion inconsisten- 
cies we employed an auxiliary hypothesis that 
the subjective probabilities of the propositions 
would be distorted in the direction of wishful 
thinking. Internal evidence indicates that this 
assumption was a valid one. The 16 syllogisms 
were partitioned into two subsets: the eight 
in which the desirability of the conclusions was 
low relative to that of the premises; and the 
eight whose conclusions had relatively high de- 
sirabilities. Wishful thinking would tend to 
make the subjective probabilities of the conclu- 
sions low relative to those of their (more de- 
sirable) premises in the first subset of syllogisms, 
while the subjective probabilities of the con- 
clusions in the second set should be high rela- 
tive to their less desirable premises. This 
prediction is borne out by the subjective prob- 
ability ratings obtained in the first session, as 
shown in Table 1. (To avoid confusing them 
with the significance levels, the obtained sub- 
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TABLE 1. Subject Probability Scores (Multiplied by a Factor of 100) in the 


No-Communication Conditions 


Session 


Major Minor 
Premises Premises Premises sions 


Joint of Conclu- over 
Premises 


Syllogisms with con- 

clusions less desir- Ist 

able than premises Changes, Ist to 2nd 
Syllogisms with con- 

clusions more desir- Ist 

able than premises Changes, Ist to 2nd 


62.15 54.14 33.66 35.86 2.20 
—3.04 1.45 


—0.79 547 5.96 


55.50 55.41 30.12 47.30 17.18 


5.96 — —1.98 2.06 0.35 —1.72 


jective probability ratings given in this report 
are the obtained probability ratings, multiplied 
by 100.) The mean subjective probability of the 
conclusions exceeded the mean joint probability 
of the premises by only 2.20 points on the 100- 
point scale in the first eight syllogisms, while in 
the other eight syllogisms (the conclusions of 
which had relatively high desirability) the mean 
subjective probability of the conclusions ex- 
ceeded the mean joint probability of the prem- 
ises by 17.18 points. This difference between 
2.20 and 17.18 points is significant beyond the 
01 level. (All significance levels reported in this 
paper are based on analyses of variance and 
two-tailed tests.) Hence, the attempted manipu- 
lation of initial inconsistency by use of the ten- 
dency toward wishful thinking was successful. 

The main effect. The changes in opinion 
obtained in the conditions in which no persua- 
sive message was communicated on the given 
issues provide the test of the main Socratic hy- 
pothesis, that when the person's opinions on 
related issues are elicited in close temporal con- 
tiguity, his need for consistency is brought 
more fully to bear on them. Hence, when these 
opinions are again elicited on a later occasion, 
they will be found to have changed towards 
greater mutual consistency. In view of the ini- 
tial distortions from consistency produced by 
wishful thinking, it would follow that in the 
no-communication conditions in the present ex- 
periment initial questioning should produce an 
increase in the subjective probabilities of the 
conclusions relative to those of the premises in 
the first eight syllogisms (those with the rela- 
tively undesirable conclusions); while for the 
second eight syllogisms, there would be a rela- 
tive decline in the subjective probabilities of 
the conclusions. 

The results bear out this prediction. As can 


be seen in Table 1, the gain in probability of 
the conclusions over that of the premises in the 
first eight syllogisms averaged 5.96 points. The 
probabilities of the conclusions of the second 
eight syllogisms, on the other hand, showed a 
loss of 1.72 points relative to the premises. 'This 
net difference of 7.68 points between means is 
significant at the .02 level. Hence, simply elicit- 
ing opinions on logically related issues does 
tend to move those opinions toward greater 
mutual consistency when subsequent elicitation 
occurs one week later. 

The difference obtained between the two sets 
of syllogisms as regards internal changes is in 
the direction that would be expected on the basis 
of a simple regression from the initially ob- 
tained difference. In order to test whether there 
is a Socratic effect over and above simple re- 
gression, an additional covariance analysis was 
carried out. The obtained difference in change 
of probability scores between the two sets of 
syllogisms was adjusted for the difference in 
initial levels by means of the obtained within- 
set correlation between the initial probability 
scores and the changes therein. Even when the 
obtained changes are adjusted for initial levels 
by this analysis of covariance, the difference in 
changes remains significant at the .02 level, 
indicative that a significant Socratic effect re- 
mains, even when the effect of regression is 
partialed out. 

The Socratic effect tends to be produced pri- 
marily by the first contiguous elicitation of the 
opinions, with only slight evidence of an accu- 
mulative effect from subsequent, additional 
elicitations. As discussed above, there was a net 
Socratic effect, from the first to second elicita- 
tion of the opinions, of 7.68 points. The cor- 
responding net effect, from second to third 
elicitations was again in the predicted direction 
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but amounted to only 2.62 points, which falls 
far short of the conventional levels of statistical 
significance. 

Focus of the Socratic effect. An attempt was 
made to answer the further question of where 
this Socratic effect is focused; that is, which of 
the person's initially inconsistent opinions are 
most likely to be changed in bringing about the 
greater consistency? The present data were an- 
alyzed to ascertain whether the amount of 
change was related to any of three variables: 
the logical status of the propositions, their ini- 
tial valence, or their ordinal position on the 
questionnaire in the first session. 

As regards logical status of the proposition, 
we tested whether the major premise, minor, 
and conclusion contributed differentially to the 
change towards consistency. All of the syllogisms 
used in this study were of the “Barbara” type. 
Hence the major premise contained the M-P 
terms; the minor, the S-M terms; and the con- 
clusion, the S-P terms (where S refers to the 
subject of the conclusion; P to its predicate; 
and M to the middle term). 

Logical status of the proposition did have a 
significant overall impact on amount of opinion 
change. As can be seen in Table 1, the major 
premise contributed most of the change towards 
increased consistency, the mean change being 
3.50 points on the 100-point scale. The conclu- 
sion showed the next greatest change towards 
consistency, a mean of 2.41 points; and the 
minor premise contributed least, in fact their 
mean change was slightly (1.71 points) away 
from consistency. These differences among the 
three means are significant at the .05 level by 
an overall analysis of variance. 

The overall significance of the variance 
among the three means is due primarily to the 
greater change towards consistency (averaging 
2.95 points) of the propositions with the P 
terms (namely, the major premises and the con- 
clusions) than of those without these P terms 
(namely the minor premises), which averaged 
a 1.71 point change away from consistency. The 
difference between the two classes of proposi- 
tions is significant at the .03 level. It appears 
to be explicable on the basis of initial distor- 
tion. In these "Barbara" syllogisms, the P terms 
tend to have the greatest emotional valence. 
Therefore, by the auxiliary hypothesis of wish- 
ful thinking, the propositions that include these 
emotionally-tinged terms would have been the 
most distorted originally. The minor premises, 
without the P terms, are typically statements of 


class inclusion ("Socrates is a man") and there- 
fore less likely to be distorted. Hence, the 
greater change towards consistency of the other 
propositions appear to be a consequence of the 
fact that the others had contributed more of 
the original inconsistency. 

Two other propositional characteristics were 
investigated for possible relationship to the 
amount of change in opinion produced by the 
Socratic method: initial valence and initial 
order of presentation. It was found that some- 
what more change towards consistency occurred 
on the highly valenced propositions—the 12 
most and the 12 least desirable propositions— 
than on the 24 moderately valenced ones. The 
direction of this outcome tends to reverse the 
"polarity" prediction (Osgood and Tannen- 
baum, 1955; Tannenbaum, 1956), but it does 
not approach the conventional level of statis- 
tical significance. Regarding initial order, it was 
found that somewhat more change toward con- 
sistency occurred on those propositions which, 
in the first session, appeared earlier on the ques- 
tionnaire than on those that appeared towards 
the end, but that this difference was not sta- 
tistically significant. The difference has been 
predicted on the assumption that the later ap- 
pearing propositions had already, on the first 
administration of the questionnaire, been 
brought into closer agreement with the opin- 
ions given on the propositions that had come 
earlier, so that less change remained to be 
made. 

A number of other hypotheses, not tested in 
this study, regarding propositional character- 
istics that affect amount of change in cases of 
mutual inconsistency could be tested. Opinions 
on more familiar issues may be more stable 
than those on less familiar. Opinions on issues 
that show greater cognitive complexity and 
greater articulation with the rest of the person's 
cognitive system [as defined, for example, by 
Zajonc's (1954) procedures] might participate 
less in the change towards consistency than 
more isolated and undifferentiated opinions. 
TTannenbaum's (1956) finding (that when differ- 
ently valenced sources and stands on issues are 
paired, the person's attitude on the issue shows 
less change than his attitude towards the source) 
is suggestive in this regard. Following Asch's 
(1952) thinking that persuasion involves a 
change of the issue being judged rather than 
change of judgment on the given issue, we 
might expect that in case of conflict, the great- 
est change would occur on the issues most easily 
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redefined. Another hypothesis is that the rela- 
tive proneness to readjustment of conflicting 
opinions is related to the bases for accepting 
the opinions. For example, following Kelman's 
(1956) terminology, we might conjecture that 
opinions based on internalization might be 
more prone to change, in case of internal con- 
flict, than those based on identification or on 
compliance. These latter need not be consistent 
with one's other beliefs, because the adherence 
to them is based on extrinsic considerations. 


Hypothesis Concerning Logical Repercussions 


The results just reported regarding the So- 
cratic effect hypotheses are all based on data in 
the no-communications conditions. The follow- 
ing results, which deal with the logical reper- 
cussions of persuasive messages are based on 
data from the conditions in which persuasive 
communications were received. The changes 
reported as due to the persuasive messages are 
computed from a baseline provided by the 
change from first to second session in the no- 
communication condition. Since the results 
concerning the Socratic effect indicate certain 
systematic changes in opinions occur from first 
to second session even in the absence of persua- 
sive messages, the message-induced changes must 
be computed from a control baseline that will 
eliminate those other changes and also any 
regression trends. 

Immediate postcommunication changes. 'The 
induced changes of opinion on the explicit tar- 
get issues (the premises) were considerable. Each 
message argued that the stand taken in one of 
the premises of each syllogism was true and 
that, therefore, the receiver should increase his 
subjective probability regarding the truth of 
that proposition. As can be seen in Table 2, 
the mean induced increase on the 16 explicit 
propositions immediately after the communica- 
tion was 17.56 points on the 100-point scale 
(p < .001). 

In order to maintain cognitive consistency, 
the Ss should therefore have increased their 
subjective probabilities regarding the truth of 
the conclusions. Even though neither conclu- 
sions nor the additional premises needed for 
their derivation had been adverted to in the 
communications, a considerable immediate post- 
communication increase was indeed produced 
on these conclusions. As can be seen in Table 2, 
there was an increase (as required for consis- 
tency) immediately after the messages on 14 of 
the 16 unmentioned conclusions, the mean in- 


TABLE 2. Changes in Subjective Probabilities 
on Explicit Issue and on Derived, Unmentioned 
Issues Immediately After and One Week after 
Persuasive Messages 


Discrep- 
ancies of 
Obtained 
Logically from 
Obtained Required Obtained Required 


Change Change Change Change 
on on Con- on Con- on Con- 
Premises clusions clusions clusions 
Changes from 17.56 11.57 5.96 —5.61 
before to 
immediately 
after 
Changes from —843 —6.69 —1.48 5.20 
immediately 
to one week 
after 


crease being 5.96 points. A mean change of this 
magnitude is significant at the .01 level. Hence, 
the results do indicate a consistency tendency 
sufficiently strong to cause a persuasive message 
to have an impact on logically related issues 
not explicitly mentioned in the communication. 

These obtained changes on the derivative 
issues are, however, less than the amount re- 
quired for complete logical consistency as de- 
termined by substituting the amount of the 
obtained change on the explicit premise in 
Equation 3. The mean required increase on the 
conclusions so computed was 11.57 points, while 
the actually obtained increase was only 5.96 (see 
Table 2), a difference significant beyond the .05 
level. Hence, persuasive messages, while they do 
produce changes on remote, unmentioned issues, 
produce less than the amount logically required, 
a finding that supports the hypothesis of a cer- 
tain amount of cognitive inertia. 

Delayed postcommunication changes. A con- 
siderable amount of the immediately induced 
persuasion on the explicit target issues had dis- 
sipated by the end of the week following the 
communication. As can be seen in Table 2, 
almost half of the immediately induced change 
on these explicit issues had dissipated one week 
later. The mean loss of 8.43 points was signifi- 
cant at the .02 level. On the other hand, the 
9.13 points of change remaining after one week 
was also significant (at the .01 level), indicating 
that there is still a significant amount of change 
of opinion on the explicit issue one week after 
the persuasive communication. 

When this loss on the premises is substituted 
in Equation $ the amount of the originally in- 
duced change on the unmentioned derived is- 
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sues that should logically have dissipated dur- 
ing the succeeding week was computed as 6.69 
points (see Table 2). The obtained loss during 
the interim, however, was only a negligible 1.48 
points, a gain of 5.20 points relative to the 
explicit premises for the week (p = .10). In fact, 
by the week after the persuasive communica- 
tion, the net mean change on the conclusions 
from their precommunication level was an in- 
crease of 4.47 points, which was almost as much 
as the 4.89 points logically required in view of 
the net changes on the premises. The trivial 
extent to which the change on conclusions fell 
short of the required amount one week after, 
as compared with the significant extent to which 
it fell short immediately after the communica- 
tion, gives some support to the hypothesis that 
the persuasive messages had a delayed-action 
effect on the derived issues. The net change on 
the derived issues did not actually increase with 
the passage of time, but it fell off less sharply 
than would have been expected on the basis of 
the usual rate of temporal decay of the imme- 
diately transmitted opinion change, suggesting 
that continued seepage of the impact on the 
explicit issue almost completely overcame the 
opposing decay effect on the derived, unmen- 
tioned issue, 

Comparison with previous studies. One 
methodological artifact in the present study, 
the use of a before-after design, may have aug- 
mented the amount of the logical repercussions 
of the persuasive messages. In order to obtain 
each S's precommunication opinions he was re- 
quired to fill out, one week prior to receipt of 
the persuasive messages, an opinion question- 
naire containing all the issues, including the 
auxiliary premises and conclusions not men- 
tioned in the message itself. It is possible that 
having stated his opinions on these related 
issues during the previous week may have some- 
what increased their saliency so that they were 
more affected by the message than would have 
been the case had they not been raised the 
previous week. 

On the other hand, the finding of logical 
repercussions in the present experiment is par- 
ticularly convincing in view of the fact that the 
method included a number of conditions that 
have been shown in previous studies to militate 
against logical repercussions. In the first place, 
not only was the conclusion itself unmentioned 
in the messages but also one of the two prem- 
ises necessary for its derivation was left unmen- 
tioned. Also, the $s employed were of a type, 


being below average in intelligence, that has 
been found (Cooper and Dinerman, 1951) to 
be relatively resistant to nonexplicit messages. 
Furthermore, no mention was made during the 
experiment of our interest in consistency, and 
the logically related propositions were widely 
interspersed with other propositions within the 
questionnaire. Finally the Ss gave their opinions 
on a scale with whose computational axioms 
they were unfamiliar, so that they could hardly 
have determined the extent of their inconsis- 
tencies even had they been interested in this 
matter. 

Hence, the negative indications reported in 
some previous studies for the hypothesis that 
persuasive communications have logical reper- 
cussions require comment. While more persua- 
sion has typically been found on a derived 
issue if that issue is explicitly mentioned rather 
than only implied, the data reported typically 
do show some change on the nonexplicit issue 
(Hovland, Lumsdaine, and Sheffield, 1949; Hov- 
land and Mandell, 1952). Also, the relative lack 
of effect on the derived issue is most evident 
in the immediate postcommunication periods. 
Hovland, Lumsdaine, and Sheffield (1949, pp. 
70-71) found that the “Why We Fight" films 
did have an impact on more general attitudes 
after the passage of a greater amount of time, 
and have postulated that there may be need for 
a “sinking in” period before these derivative 
effects are felt. Similarly, Katz, Stotland, 
Patchen, and Jochen (1957) report that giving 
Ss case history material regarding the roots of 
prejudice causes little immediate effect on their 
own prejudice, but after the passage of several 
weeks, the extrapolation of the intellectual 
knowledge to their opinions does occur. Fur- 
thermore, the magnitude of the logically re- 
quired change on the derived issue is, as shown 
by Equation 3, less than that on the explicit 
issue. This logically consistent attenuation of 
the persuasive impact on derived issues due to 
a compounding of contingencies becomes more 
severe as the derived issue becomes more ten- 
uously related to the explicit issue. For exam- 
ple, the more premises beside the explicit one 
needed for its derivation, the less the logically 
required change on the derived conclusion. 
Where the derived issues are as remote from 
those explicitly dealt with as was the case, for 
example, in the Hovland, Lumsdaine, and Shef- 
field “Why We Fight” studies (1949), very little 
change on the derived issues would be logically 
required even in a perfectly consistent S. 
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However, while only a small change in opin- 
ion is logically required on the derived issue, 
the amount actually occurring is even less. Peo- 
ple's opinions seem to follow a “loose-link” 
model such that a considerable change can be 
induced on one cognition before any pull is 
exerted on related cognitions. This effect, how- 
ever, is at least partially corrected by the con- 
tinued seepage of change from the explicit 
issue to the derived one as time passes after the 
receipt of the communication. 

The question of conscious awareness arises 
in connection with the psychodynamics of the 
present effect: to what extent is it necessary 
that the person be consciously aware of the 
inconsistencies among his beliefs in order that 
the effects occur? In the present study several 
procedures were employed in an attempt to 
minimize the degree of conscious awareness. 
(For example, no mention was made to the Ss 
that their consistency was being assessed; the 
related propositions were widely separated from 
one another in the questionnaire; a quantita- 
tive expression of opinion was required and the 
assessment of inconsistency involved a mathe- 
matical model unfamiliar to the Ss.) Hence, 
the present findings indicate that the effects 
occur even under conditions that attempt to 
avoid any suggestion of inconsistency. 

The Ss’ level of awareness of inconsistency 
was not systematically varied within the present 
experiment, but it is interesting to speculate on 
the results of such a manipulation. Increasing 
the person’s awareness of his inconsistency by, 
say, explicitly pointing it out to him might pro- 
duce greater changes towards consistency. On 
the other hand, such a procedure could well 
have the opposite effect by provoking the per- 
son's hostility or defensiveness, and so result, 
not in a greater change towards consistency, 
but in some other reaction such as the rejec- 
tion of consistency, with Carlyle, as the hob- 
goblin of little minds; or in the repression of 
some of the conflicting issues. 


SUMMARY 


Two sets of hypotheses were derived from 
the postulate that people tend to maintain con- 
sistency among their opinions on logically re- 
lated issues, First, it was hypothesized that 
opinion change would be effected by the So- 
cratic method of simply asking the person to 
State contiguously his opinions on logically re- 
lated issues, thus bringing the tendency towards 


consistency to bear on any logical discrepancies 
among these opinions. Secondly, it was hypoth- 
esized that a persuasive communication di- 
rected at a person's opinion on some explicit 
issue tends to change also his opinions on log- 
ically related (even if unmentioned) derivative 
issues in a consistent direction. A procedure 
for quantitatively measuring the degree of log- 
ical consistency among opinions on related 
issues was described. 

The Ss were 120 college students participat- 
ing in three experimental sessions. In the first, 
before-communication opinions regarding the 
subjective probabilities and desirabilities of 16 
sets of syllogistically related propositions were 
obtained. In the second session, 16 messages, 
each arguing for the truth of one premise of 
each syllogism, were communicated, and the 
immediate postcommunication opinion of the 
truth of each of the 48 propositions was ob- 
tained. Each $ served in communication condi- 
tions with respect to some issues, and control, 
no-communication conditions with respect to 
others. In the third session, one week later, the 
delayed postcommunication opinions on the 
truth of the 48 propositions were obtained. 

With respect to the Socratic effect hypothesis 
a significant amount of inconsistency attribut- 
able to wishful thinking was found, as postu- 
lated on the first elicitation of the opinions 
(p < .01). One week later, on a second elicita- 
tion of the opinions, there was a significant 
(p — .02) decline in the amount of this wishful 
thinking, indicating that the Socratic method 
did indeed produce the predicted change to- 
wards consistency. Several propositional vari- 
ables were examined for possible relationship 
to the amount of change towards consistency. 
It was found that propositions with the predi- 
cate term (namely, major premises and conclu- 
sions) contributed significantly more (p — .03) 
of the change towards consistency than those 
without the predicate term. 

With respect to the hypothesis concerning 
logical repercussions, it was found that imme- 
diately after the communication there was an 
induced opinion gain not only on the explicit 
target issues (p < .001) but also the derived is- 
sues (p — .02), even though neither the derived 
issues themselves nor even the auxiliary prem- 
ises needed for their derivation were mentioned 
in the communication. This indirectly induced 
gain on the derived issues fell short, however, 
of the full amount needed for complete logical 
consistency (p < .05). One weck later, there was 


eS SS 


COGNITIVE CONSISTENCY AND ATTITUDE CHANGE 


365 


some suggestion that the impact of the persua- shown only 52% of the logically required 
sive message had continued to seep down to amount of change, while by one week after, this 
the derived issues: immediately after the mes- impact had risen to 91% of the required amount 


sages the opinions on the derived issues had (p = .10). 
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B. Behavior Theories 


40. Attitude Formation: The Development of a 
Color-Preference Response through 


Mediated Generalization 


BERNICE EISMAN LOTT 


The present investigation of attitude develop- 
ment has had as its major stimulus Doob's 1947 
discussion (7) and reinterpretation of the con- 
cept. His placement of attitude within the 
framework of behavior theory and his emphasis 
on attitude as a learned response to definable 
stimuli appears to be a more research oriented 
and potentially fruitful approach to the prob- 
lem than has been taken in the past. An equally 
important source of stimulation for this study 
has come from the work on mediated and se- 
mantic generalization, particularly from the 
work of Birge (2), Cofer and Foley and asso- 
ciates (3, 4, 5, 8, 9), Keller (11), Razran (13, 14), 
and Reiss (15, 16). 

An attitude is here defined, as suggested by 
Doob, as a learned implicit anticipatory re- 
sponse having both cue and drive properties. 
An anticipatory response is one which origi- 
nally came after a rewarded response but as a 
result of being most closely associated with the 
reward moves forward in the behavior sequence. 
This results in its "anticipatory" occurrence 
directly to the external stimulus before its orig- 
inal time in the response series. As a response 
with stimulus properties, an attitude falls into 


* Reprinted from Journal of Abnormal and Social 
Psychology, 1955, 50, 321-826, with permission of 
the author and the American Psychological Associ- 
ation, 
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the class of "pure stimulus acts" defined by 
Hull (10) as responses whose major function is 
to serve as stimuli for other acts. It follows, 
therefore, that once such an implicit response- 
produced stimulus (r-s) has been established, it 
can be conditioned to evoke overt behavior. In 
this way an attitude acts as a mediator of re- 
sponses. It is these responses, mediated by an 
attitude, which are measured by attitude ques- 
tionnaires in attitude experiments, etc. 

The property of drive value ascribed to an 
attitude is derived from the fact that an atti- 
tudinal response is assumed to produce strong 
stimulation. Any strong, persistent stimulus is 
considered by Miller and Dollard (6, p. 30) to 
constitute a drive. Reduction of the strong 
stimulus is drive reducing and therefore rein- 
forcing. It is expected, therefore, that any overt 
response which leads to a significant reduction 
of the strong stimulus will be learned. Ascrib- 
ing drive value to attitudes distinguishes them 
from other implicit, anticipatory response-pro- 
duced stimuli and is in line with traditional 
views. Attitudes, wrote Allport (1), "both moti- 
vate and guide; they supply both drive and 
direction." Sherif and Cantril (17, p. 300) offer, 
as one criterion of an attitude, its affective 
property. Krech, too, speaks of attitudes as mo- 
tivational, thus distinguishing them from be- 
liefs (12, p. 152). 
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STATEMENT OF THE PROBLEM 


The purpose of the present study has been 
to demonstrate experimentally that attitude 
formation is amenable to laboratory observa- 
tion and control, and can be adequately con- 
ceptualized and explained by reinforcement 
learning theory. The general hypothesis of this 
study can be formally stated as follows: A posi- 
tive preference or attitude toward an object 
with which an individual has had only neutral 
experience can be developed through mediated 
generalization. In more operational terms, it is 
predicted that if individuals are taught to at- 
tach the same label to two stimuli which differ 
from one another in all other relevant ways, 
and if these individuals subsequently have re- 
warding experience with one of these stimuli, 
then a positive preference for the other stimu- 
lus, as well as for stimuli resembling the latter, 
will be demonstrable. This type of learning 
experience follows the general paradigm for 
mediated generalization. In this study the sub- 
jects develop a positive response for a color 
after being subjected to such a series of learn- 
ing experiences. 

To test this general hypothesis four different 
test situations, in which a color preference 
could be demonstrated, were employed. These 
Situations, differing in context, complexity, and 
Social significance, will be described in detail 
below. 


METHOD 


Apparatus and Subjects 


A wooden stage, 20.5 in. high, 30 in. wide, 
and 19 in. deep, with a raised, slanted platform, 
at an angle of about 35 degrees from the hori- 
zontal, was constructed and used for the pre- 
Sentation of various stimulus objects to the 
Subjects. An unbleached muslin curtain could be 
pulled in front or away from the platform in 
order to hide or expose objects on it. Three 
small depressions about 8 in. apart were drilled 
into the platform to serve as receptacles for 
marbles which were used as rewards during 
certain portions of the training. 

The following stimulus objects were used: 
(a) three wooden geometrical blocks, each 
painted white and .75 in. thick, triangular, cir- 
cular, and square, respectively; (b) three thin 
rectangular blocks, each 2.5 in. by 6 in. and .75 
in. thick, painted green, yellow, and black, re- 
spectively; (c) three metal jar tops, 2.5 in. in 


diameter, also painted green, yellow, and black, 
respectively; (d) three glasses, 3.5 in. high, 
painted green, yellow, and black, respectively; 
(e) three thick rectangular blocks, 1.5 in. by 1.5 
in. by 4.5 in., painted green, yellow, and black, 
respectively; (f) three white wooden “nonsense” 
blocks, .75 in. thick, and of three different non- 
sense shapes. 

Eighty-one subjects (Ss) were used, 41 boys 
and 40 girls, ranging in age from 5 years, 10 
months to 8 years, 10 months. All Ss were at- 
tending an elementary school operated by the 
Department of Education of U.C.L.A. All mem- 
bers of each of the classes used in this study 
were utilized by the experimenter (E), i.e., there 
was no arbitrary selection of Ss from within any 
class. The Ss were told that E had a new game 
she wanted to try out and that she would play 
with each of the children in a class, one at a 
time. Owing to the nature of the school, all 
children in it were accustomed to observation 
by strangers and many of them were also accus- 
tomed to leaving their classes for some time for 
testing or other “‘play’’-research purposes. 


Procedure 


The 81 Ss were randomly divided into four 
groups with 9 Ss in Group I, 18 Ss in Group II, 
and 27 Ss each in Groups III and IV. One third 
of the Ss in each group was randomly selected 
to be trained positively for each of the three 
colors, yellow, green, and black. The E worked 
with one S at a time. Since the name of each S$ 
had been assigned to an experimental group 
prior to the start of the experiment, the order 
in which $s came to E was not fixed, and either 
the teacher or the last $ chose the next S. 

Part 1. Name learning for the geometrical 
blocks. Each S sat facing E and the front of the 
stage. The Ss in all four groups were treated 
alike during this portion of the training. The 
three geometrical blocks were placed on the 
stage by E, and S was told that all three had 
names which $ had to discover, the names being 
"egg" (for the triangle), "car" (for the circle), 
and "shoe" (for the square). Each $ was shown 
one block at a time in a predetermined random 
order which was the same for all Ss. When a 
block was presented, $ guessed at its name and 
E told S if he was right or wrong. The S$ was 
given as many trials as was necessary in order 
for him successfully to identify each of the three 
blocks five times in succession. Each $ was told 
that learning the names of the blocks was neces- 
sary in order to proceed to the next part of 
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the game during which he could win some 
marbles. 

Part 2. Reward series. Here, too, Ss in all 
four groups were treated alike. The geometrical 
blocks were again placed on the platform and 
S was told that E would hide a marble under 
one of them when the curtains were closed and 
then ask $ to guess where the marble was. The 
S was instructed to point to the block under 
which he thought the marble was hidden and 
to call the block by its name, the name learned 
in Part 1. When placing a marble under the 
proper block, E was careful to make extraneous 
noise so that sound would not serve as a reli- 
able cue with respect to the hiding place of 
the marble. 


A marble was always hidden under the same 
block for any given S. For those who were to be 
trained to prefer green, the marble was always 
placed under the triangle ("egg"); for those to 
be trained for yellow, the marble was placed 
under the circle ("car"); and for those to be 
trained for black, the marble was always placed 
under the square ("shoe"). The relative posi- 
tions of the blocks were shifted per trial accord- 
ing to a random, predetermined order which 
was constant for all Ss. 

Each time a marble was hidden S was per- 
mitted only one guess as to its location. If S 
guessed correctly, he kept the marble; if he 
guessed incorrectly, E closed the curtains and 
hid the marble again, under the same block 
but in a new position with respect to the other 
two. The reward series ended when $ had made 
six consecutive correct guesses. If the criterion 
was not reached by the fourteenth trial, a cor- 
rection procedure was instituted, ie, when S 
guessed incorrectly, E lifted the correct block 
and showed $ where the marble was. 


The purpose of this part of the training was 
to provide differential reinforcement for one of 
the three names ("egg," “car,” or "shoe") which 
S had learned in Part 1. This is the only por- 
tion of the training in which such differential 
reinforcement could be obtained. 


Part 3. Name learning for the thin rectan- 
gular blocks. Immediately after the reward se- 
ties, E presented the three colored rectangular 
blocks, instructing $ that one was called "egg" 
(the green block), one was called "car" (the yel- 
low block), and one was called "shoe" (the 
black block). The $ was again asked to discover 
which block had what name. The procedure 
followed here was identical to that followed in 


Part 1 and again Ss in all four groups were 
treated alike. 

Part 4. Review series. Groups I and II. Im- 
mediately following Part 3, 5s in both of these 
groups were tested for recall of the names of 
the geometrical blocks and of the colored rec- 
tangles and for recall of the marble game (re- 
ward series). 

Groups III and IV. For Ss in these groups a 
longer and more elaborate review was given in 
order to introduce more reward and color- 
naming experience. This was done because the 
results of a preliminary investigation had led 
to the conclusion that owing to the nature of 
the test situations (to be described below) given 
$s in Groups III and IV, additional reward and 
naming experience would be necessary for me- 
diated generalization to be demonstrable. 

The E presented three new stimuli, the col- 
ored jar tops, and asked $ to guess what their 
names were. The S received a marble each time 
he correctly named all three stimuli. When $ 
reached the criterion of three correct identifica- 
tions, the colored glasses were presented and 
the same procedure was followed. The E next 
presented the three thick rectangles and asked 
S to guess their names. Correct identifications 
were calling the green objects “egg,” the yellow 
ones "car," and the black ones “shoe.” 

The Ss were then reviewed on the marble 
game (reward series). Then one jar top, one 
glass, and one rectangle, each of a different 
color, were presented. All three of the possible 
combinations were presented successively and S 
was asked to name the objects. No marble re- 
wards were given. 

Part 5. Test for mediated generalization and 
color preference. Here the major differential 
treatment for the four experimental groups was 
introduced. The tests were given immediately 
after the review. 

Group I. The Ss in this group were presented 
with the three thin rectangular blocks used in 
Part 3 and told, "Now we'll play the marble 
game with these. You tell me where you think 
the marble is. Don't forget to tell me the name 
of the block you think the marble is under." 
With the curtains hiding the platform, E placed 
a marble under each block. Only one trial was 
given, and after $ made his choice he was told 
that there would be no more games. A right 
choice was picking that colored rectangle which 
had the same name as the geometrical block 
under which $ had consistently found marbles 
in Part 2. 
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Group II. The Ss in this group were also told 
that they were going to play the marble game 
again. The curtains were drawn, but instead of 
placing the three thin rectangles on the plat- 
form, as was done with Group I, E substituted 
three new stimuli, the colored jar tops, with 
which Ss in Group II had had no prior experi- 
mental experience. A marble was placed under 
each jar top. When the curtains were opened, 
S was told to choose that jar top under which 
he thought a marble was hidden and to give its 
name ("egg," "car," "shoe"). The E did not tell 
$s that these were the names of the new ob- 
jects, and Ss had had no experience in so nam- 
ing them. Only one choice trial was given. A 
correct choice was picking that jar top which 
was the same color as the rectangle which S, in 
Part 3, had learned had the same name as the 
geometrical block under which marbles had 
consistently been found in Part 2. 

Group III. The Ss in this group were pre- 
sented with three new stimuli, the white "non- 
sense" blocks, and instructed that one was called 
"green," one "black," and one "yellow." The $ 
was told that he had to discover which block 
had what name. The same name-learning pro- 
cedure as was used in Parts 1 and 8 was utilized 
here, and again a criterion of five successive 
correct identifications of each block was em- 
ployed. After § reached the criterion, he was 
asked, "If you could take one of these blocks 
home, which would you pick?" 

The color name learned for each of the three 
"nonsense" blocks was varied from $ to $ in 
order to correct for possible preference for one 
of the blocks because of its shape. Thus, for 
example, of three subjects, all trained to prefer 
black ("shoe") by having been given differential 
reinforcement for the square block ("shoe") in 
Part 2, each learned to call a different "non- 
sense" block "black" in this test situation. Each 
of the three Ss was expected to choose that 
block which he had learned to call "black." 

Group IV. The E told the Ss in this group a 
story about a pair of twins who had just moved 
into a new neighborhood. During the course of 
the story the twins meet three groups of chil- 
dren, a green, a yellow, and a black group. The 
Play activities described for the three groups 
are almost identical. The climax of the story is 
reached when the twins decide to have a birth- 
day party and their parents insist that they can 
invite only one of the groups of their new 
friends. ‘The twins must choose only one of the 
groups for their party. Each $ was asked by E 


to choose the group he would invite. A correct 
choice was picking that group having the same 
color as the rectangular block which $ had 
learned, in Part 3, had the same name as the 
geometrical block under which he had consis- 
tently found marbles in Part 2. 


RESULTS 


"Table 1 summarizes the generalization results 
obtained from S’s responses in their respective 


TABLE 1. Generalization Results: Responses 
of Subjects in All Groups in Their Respective 
Test Situations 


Number Number 
of Ss of Ss 
Making Making 
Correct Incorrect 
Group N Choice Choice x? p* 
I 9 8 (8) 1( 6) — < .002** 
I 18 14 (6) 4 (12) 16.00 < 001 


ul 27 16 (9) 
IV 27 17 (9) 


11 (18) 8.17 <.01 
10 (18) 10.67 <.01 


*Probability that the difference between obtained and ex- 
pected frequencies could have occurred by chance alone. 
**Since the theoretical frequencies and total N of Group I 
are too low for the reliable use of x?, the probability figure 
shown was obtained by evaluating the data by means of 
the binomial expansion (both tails of the distribution con- 
sidered). 


test situations. The numbers in parentheses 
represent expected frequencies. It was expected 
that since each $ had to choose among three 
colors or three colored objects in the test situa- 
tion, one third of the group could choose the 
correct (ie. trained for) color and two thirds 
of the group could choose the incorrect color 
by chance alone. A significant deviation from 
these proportions in the direction of more than 
one third Yeses and less than two thirds Noes 
refutes the null hypothesis and lends support 
to the experimental hypothesis. 

It can be seen from Table 1 that the per- 
formance of all groups verified the experimental 
hypothesis. The null hypothesis of no signifi- 
cant difference between obtained and theoret- 
ical frequencies can be safely rejected, and the 
training given each group is thus shown to be 
adequate for the generalization of a positive 
choice response from one stimulus to a previ- 
ously neutral one. 

A chi-square test of independence was also 
run on the data shown in Table 1 to sce if 
there were any significant differences among 
the four groups in the success-failure ratio. No 
such differences were found (x? = 8.0; p.» 
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Figure 1. Conditions for and results of training for Ss in Group IV. S stands for 
stimulus of either geometric blocks (g) or colors (c); R stands for overt responses 
of either naming (n) or choosing (ch); r stands for implicit responses and s stands 
for stimuli produced by these implicit responses. 


A test between Groups I and II combined and 
Groups III and IV combined also yielded in- 
significant results, although there is an indica- 
tion that the more difficult task required of Ss 
in these latter two groups served to lower the 
number of correct responses (12 = 3.37; p> 
:05).1 


GENERAL DISCUSSION AND CONCLUSIONS 


The results of this experiment have shown 
that conditions which allow for mediated gen- 
eralization, and which, in certain cases, offer 
Ss additional opportunities for reward and 
“labeling” experience, are sufficient to produce 
à positive response toward a previously neutral 
color with which Ss had had no differential 
reward experience. From this positive response, 
a favorable “attitude” towards this color is in- 
ferred. In addition, this favorable attitude has 
been demonstrated in more than one type of 
situation. 

Space limitations do not permit a full pre- 
sentation of the analysis of the results of this 
study in terms of learning theory concepts and 


1 The influence of age and sex on generalization 
in this experimental situation was tested by means 
of chi square. No significant differences were found 
to exist between Ss above and below the median 
age of their respective groups with respect to suc- 
cess and failure on the generalization tests. Simi- 
larly, no significant difference in the success-failure 
ratio was found to hold between the sexes. 


principles. One illustration of the kind of anal- 
ysis performed will have to suffice. 

Figure 1 represents the conditions for and 
results of training for Ss in Group IV. Stage A 
represents Parts 1 and 3 of the experimental 
training, ie. the name-learning sessions; Stage 
B represents Part 2, the reward session; and 
Stage C represents Part 5, the test situation. 
Stage A might have been better represented by 
three diagrams—one for each geometrical block 
and colored rectangle pair to which S learned 
to give the same name. For convenience, only 
one diagram, however, is used to represent all 
three. In stage B only one S, is referred to, 
depending upon which geometrical block a 
particular $ was consistently rewarded for choos- 
ing. 

In stage C, the test situation, color names of 
three groups of children in a Story were pre- 
sented as oral stimuli to the Ss by E. S, repre- 
sents these color-name stimuli. Each of the 
colorname stimuli can evoke a specific T,-5, 
because of the relationship shown in Figure 1 
under the heading "conditioning assumed to 
have occurred prior to the experiment." It is 
assumed that the Ss, outside the experimental 
situation, have learned to make this same re- 
sponse to both $, (a visual color stimulus) and 
Sen (an auditory color stimulus, ie, a color 
name). This same response is R,, (naming the 
color, for example, "yellow". If we now invoke 
the principle of mediated generalization we can 
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say that any response subsequently conditioned 
to S, will also be evocable by S,,. In stage A 
such a new response was conditioned to S,, 
namely the “egg,” "car," or "shoe" response, 
R,. As a result, when S, is presented in stage 
C, it can evoke the implicit and anticipatory 
component of R,, namely r,-s,. 

Once we have thus accounted for the evoca- 
tion of r,-s,, we can see how this can evoke the 
anticipatory component of R,,, namely 74:54, 
the "attitude." This stems from the condition- 
ing which took place in stage B and from the 
anticipatory nature of 7,)-s,,. In this situation 
we see that r,,-5,, evokes, not the old Rea of 
stage B, but a slightly different choice response 
(Rex). Both choice responses can be understood 
as being part of a habit family associated with 
the same stimulus, rj-5,,, the “attitude.” The 
evocation of the correct choice response, R,, 
cannot be fully explained without assuming, as 
we have done, its evocation by its implicit com- 
ponent, the attitudinal response-produced stim- 
ulus orson 

There is one part of our diagram which has 
thus far been ignored. We have said that in 
stage B r,-s, becomes conditioned to R,, and 
that similarly r,,-5,, becomes conditioned to R,. 
That this is indeed the case was demonstrated 
by the fact that many Ss, when choosing the 
colored group of their choice also used an 
"egg," “car,” or "shoe" name to refer to it. 
Stage C, therefore, has been diagrammed to 
include the evocation of R, by Ten-Scn- 

Analyses of the conditions and results of 
training for Ss in Groups I, II, and III involve 
the application of similar principles. 

It is hoped that the present study represents 
à step in the direction of experimentally vali- 
dating certain principles of learning theory on 
à human level in more or less uniquely human 
situations. The author has shown that at least 
one of these principles (mediated generaliza- 
tion) can be used to predict a certain type of 
human behavior, preference for a class of stimuli 
with which individuals have had only neutral 
experience. This type of behavior is common 
and shows itself most dramatically in the exhi- 
bition of attitudes toward social or ethnic 
groups with which individuals have had little 
or no direct experience. That such attitudes 
may be directly taught by parents or other 
“teachers” and maintained by group reinforce- 
ment is one possibility. That they may also be 
formed in the indirect manner experimentally 
demonstrated by this study is another. It is not 


too big a step to assume that if children can 
learn to prefer the color yellow, for example, 
over the colors green and black as a result of 
the type of training given them in this investi- 
gation, they can also, as a result of similar train- 
ing, learn to prefer people called Englishmen 
over those called Poles and Italians, even though 
they have had no differential reward or punish- 
ment experience with any of those three na- 
tional groups. The essential requirement for 
this learning is that individuals should learn to 
attach the same labels to the word “English- 
man” as to stimuli with which they have re- 
warding experience, and to attach the same 
labels to the words "Pole" or "Italian" as to 
stimuli with which they have had neutral or 
punishing experience. 


SUMMARY 


Eighty-one elementary school children of 
grades 1, 2, and 3 were trained to label a white 
triangular block and a green rectangular block 
"egg," to label a white square block and a black 
rectangular block "shoe," and to label a white 
circular block and a yellow rectangular block 
"car." In a subsequent situation all Ss were 
rewarded for choosing one particular geomet- 
rical white block. The Ss in Group I were then 
tested for mediated generalization by being 
asked to choose which of the colored rectangles 
they thought would bring a reward. The Ss in 
Group II were presented with three new, i.e., 
previously unexperienced, color stimuli and 
asked to guess which one would bring a reward. 
The Ss in Group III learned to label three 
different white "nonsense" blocks "yellow," 
"black," and "green," respectively, and then 
were asked to choose the block they would like 
to take home. The Ss in Group IV were told a 
story about three groups of children, a yellow, 
a black, and a green group, and then asked, 
which group they would invite to a hypothetical 
birthday party. 

For each group it was predicted that Ss would 
choose that stimulus in the test having the same 
actual color or color name as the colored rec- 
tangle which they had learned had the same 
"egg," "car," or "shoe" name as the geometrical 
block for the choice of which they had previ- 
ously been rewarded. These predictions were 
verified. Significantly more Ss in each group 
made the correct choices in the test than would 
have been expected by chance. 

The investigation was designed to test the 
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general hypothesis that a color-preference re- 
sponse (positive attitude) could be developed 
through mediated generalization, and that this 
preference could be demonstrated in four situ- 
ations differing in context, complexity, and 
social significance. An analysis, in terms of 
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41. An Outline of an Integrated Learning Theory of 
Attitude Formation and Function 


ARTHUR W. STAATS 


The history of psychology is a history of sepa- 
ratism, of clashes of men and theories. There 
have been many dimensions along which the 
field has split; for example, learning versus 
cognitive approaches. However, even within 
the general field of learning, there has been 
separatism. Even as it has been arrayed against 
other orientations, major efforts within learning 
have been expended in developing and main- 
taining separate experimental methods, sepa- 
rate general (philosophical) methodologies, and 
separate terminologies. 

I see this as being very disadvantageous; it 
is a course that has prevented the development 
of a believable learning theory of human be- 
havior. Simplistic learning approaches to human 
behavior, although frequently productive in 
their own research realms, have remained frag- 


mented and vulnerable to criticism in more . 


general considerations—it has been relatively 
easy to pick an example of behavior that the 
isolated approach could not handle. Conse- 
quently, I have attempted to integrate the basic 
principles of learning (classical and instrumental 
conditioning), the experimental findings from 
various areas of human learning experimenta- 
tion, and the observations of the social and 
behavioral sciences, to constitute a comprehen- 
sive learning theory of human behavior (see 
Staats, 1961, 19642, 1966, in press; Staats 
and Staats, 1963). This has involved an analysis 
of what we consider to be attitudes, some of 
the dimensions of which are presented here. 
First, a theory of attitudes should include the 
study not only of attitude formation, but also 
of attitude function. The former, nevertheless, 
is usually the focus of the study of attitudes, 
while the latter is taken for granted. As this 


* Prepared especially for this book. 


outline will suggest, however, an analysis of both 
aspects of attitudes is more comprehensive, the 
analysis may be used to indicate how classical 
and instrumental conditioning principles are 
closely intertwined in the study of human be- 
havior, and this in turn enables the study of 
attitudes to be interrelated with the considera- 
tions of human motivation relevant to the other 
social and behavioral sciences. 

To begin, it may be suggested that what I 
mean by an attitude is an "emotional" response 
to a stimulus—usually a social stimulus or a 
stimulus that has social significance, although 
this is only an artificial demarcation. Emotional 
responses come to be elicited by many stimuli, 
social and otherwise, in a process of classical 
conditioning. Thus, if a new stimulus is paired 
with a stimulus that elicits an emotional re- 
sponse, the new stimulus will come to do so 
also. This can occur with negative emotional 
responses, elicited by aversive stimuli, as well 
as with positive emotional responses, elicited by 
pleasant (or rewarding) stimuli. Once a stimulus 
has come through classical conditioning to elicit 
an emotional response, it can "transfer" the 
response to a new stimulus with which it is 
paired. This process is called higher-order con- 
ditioning. 

Thus classical conditioning may be said to 
account for the formation of attitudes, but it is 
still necessary to explicate the principles in- 
volved in attitude function—the reason why 
attitudes are so important to man's adjustment. 
It may be suggested, in making an integrated 
analysis, that the one process of classical con- 
ditioning actually endows the conditioned stim- 
ulus with two properties, or functions. 

The rationale may be summarized as follows. 
Stimuli that elicit emotional responses in addi- 
tion have another stimulus function. Although 
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we consider the functions separately in labora- 
tory study, the “emotional” stimuli that are used 
to elicit a response in the process of classical 
conditioning also have a function in an instru- 
mental conditioning process. Thus, for example, 
food as a stimulus elicits the salivary response 
and other emotional responses, but in addition 
it will serve another function. When food is 
presented following an instrumental response, 
that response will occur more frequently—it 
will be instrumentally conditioned. Food, or 
sex stimulation, or some other positive emo- 
tional stimulus, will function as a reinforcer or 
reward, containing the power to effect instru- 
mental behavior in the ways that rewards do. 
Electric shock, or any painful stimulus, may be 
seen as an example of a negative emotional 
stimulus that also has an instrumental function. 
Electric shock will elicit emotional responses, 
but in addition can be used as a negative re- 
inforcer to suppress an instrumental behavior 
on which it is contingent (punishment), or to 
strengthen behaviors that escape from the 
shock. 

Now, it has been suggested above that atti- 
tudinal stimuli are conditioned stimuli that 
through classical conditioning come to elicit 
emotional responses. It may also be suggested 
that in the process of classical conditioning 
both functions are “transferred” to the condi- 
tioned stimulus. That is, it would be expected 
that the conditioned attitudinal stimulus should 
also be capable of influencing the instrumental 
behaviors of the individual, as well as elicit an 
emotional response in the individual. 

Let us take an extended example by which 
to illustrate these two aspects of attitude forma- 
tion and function. Let us say that the word 
DANGEROUS has been paired with painful 
stimuli that elicit emotional responses. The 
word should come as a 0S through this to elicit 
a conditioned emotional response. I and my as- 
sociates (Staats et al., 1962) have shown that this 
emotional or attitudinal conditioning does in- 
deed take place. That is, a word was paired 
with aversive stimuli, and the word came to 
elicit an emotional response (GSR). Further- 
more, the subjects who were conditioned rated 
the word as unpleasant to an extent that was 
related to the extent of their emotional condi- 
tioning. These findings, which have been repli- 
cated by Maltzman et al. (1965), substantiate the 
analysis and also indicate that ratings can be 
used to index emotional, or attitudinal, con- 
ditioning. 


To continue with the example, however, the 
word DANGEROUS should, after pairing with 
aversive stimuli, be capable of making other 
stimuli into attitudinal stimuli, according to 
the principle of higher-order classical condition- 
ing. I and my associates have shown this in a 
number of studies (including the article that 
follows). That is, if the word DANGEROUS 
was paired with a new (social) stimulus, the new 
stimulus would be expected to come also to 
elicit the emotional response. To say that NE- 
GROES ARE DANGEROUS would condition 
the negative emotional response elicited by 
DANGEROUS to the word NEGROES. This 
would be a process of higher-order conditioning. 

Both this original conditioning and the 
higher-order conditioning have concerned only 
attitude formation. Following the preceding 
analysis, however, it would be expected that 
the word DANGEROUS, or the word NE- 
GROES, would as a result of the conditioning 
also be capable of effecting the individual's in- 
strumental behaviors. That is, the individual 
would be expected, for example, to learn escape 
behaviors that would take him away from the 
word DANGEROUS. He would avoid a bridge 
labeled DANGEROUS and so on. In the social 
area, following the example, he would also 
avoid people labeled as NEGROES. This learn- 
ing would follow the principles of instrumental 
conditioning. 

The experiment that is to be reprinted in 
the next article uses my experimental-theo- 
retical method for the study of higher-order 
conditioning of attitudinal responses. Words 
that already elicit emotional responses are used 
to condition emotional responses to new words. 
The indication of the emotional (attitudinal) 
meaning of the words is obtained from ratings 
of the pleasantness or unpleasantness of the 
words, Although this study deals only with the 
formation of attitudes toward socially signifi- 
cant stimuli, it should be noted that the same 
attitudinal words would be expected to have 
also an instrumental conditioning function. 
This analysis (see Staats, 1964b) has already 
been tested in a study in which words that had 
been rated as pleasant or unpleasant, that is, 
positive and negative emotional words, were 
presented contingent on an instrumental re- 
sponse to assess their conditioning effect on the 
response. Three groups of subjects were used: 
for one group one of two alternative motor 
responses was followed on each occasion by a 
positive emotional word (like those used in the 
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next study) such as HOLIDAY, SUPPER, 
BLOSSOM, VACATION, LAUGHTER; for 
another group negative emotional words were 
used such as BITTER, UGLY, FAT, HUN- 
GER, LOST; for a third group words of neu- 
tral meaning were used such as BOX, WITH, 
TWELVE. Positive attitudinal words presented 
in a response-contingent manner strengthened 
the instrumental response, negative attitudinal 
words weakened the response, the neutral words 
had an in-between effect. The results showed 
that words that are °Ss and elicit an emotional 
or attitudinal response, which can thus be used 
to condition emotional meaning to new stimuli, 
will also function as reinforcers in the learning 
of instrumental behaviors (Finley and Staats, in 
press). 

The analysis and the various experimental 
results have important implications for under- 
standing a powerful function of language in 
general, as well as for understanding the func- 
tions and the development of attitudes. More- 
over, the analysis and findings are also relevant 
to attitude measurement—especially to rating 
procedures such as the semantic differential. 
Not only do such ratings indicate the attitude 
responses to stimuli, but in addition they indi- 
cate the reinforcement value of the stimuli—the 
extent to which such stimuli will condition and 
maintain behavior according to the principles 
of instrumental conditioning. It may be sug- 
gested that one of the important focuses of the 
study of attitudes must become the investiga- 
tion of these principles by which attitude 
stimuli function. i 

In addition, showing the relationship of 
classical conditioning principles to the develop- 
ment of reinforcing, or attitudinal, stimuli has 
a great deal of significance in another way. 
That is, the consequent integration of the prin- 
ciples of reinforcement and the extension of 
the consideration of attitudes into the realm 
of instrumental behaviors brings a large area of 
significant human behavior into consideration 
by the analysis. This development revolves 
around the conception that what we consider 
to be attitude stimuli are a subclass of the 
larger class of reinforcing stimuli. Thus, in its 
broadest sense, the study of attitudes—and 
other reinforcing stimuli commonly described 
as values in social psychology and sociology, 
needs and interests in personality theory, evalu- 
ative meaning in experimental psychology, and 
fetishes and urges in abnormal psychology, to 
mention a few examples—is the study of hu- 


man motivation. To appreciate fully the im- 
portance of an analysis of the development and 
function of such reinforcing stimuli, a word 
must be said of the importance of these stimuli 
in determining human behavior. 

First, a great deal of information is given 
about the behavior of an individual (or, as we 
shall see, a group of individuals) when the na- 
ture of the reinforcers that are effective for the 
individual are described, as well as the rules by 
which these reinforcers are applied. For these 
two types of knowledge will tell us much about 
the types of behavior that the individual will 
develop. 

For example, in our society some of the stim- 
uli that have much reinforcing value (including 
many attitudinal stimuli) are: social and per- 
sonal attention, acclaim, and respect; money, 
fine clothes, expensive cars, and houses; and 
various tokens such as awards and honors. In 
our society, also, there are “rules” (not formal 
or explicit) for the application of these stimuli. 
That is, they are delivered contingent on some 
kinds of behavior but not on others. Thus large 
amounts of these stimuli are delivered contin- 
gent on exceptionally skilled baseball, football, 
acting, dancing, or comic behaviors, among 
others. Relatively small amounts are delivered 
contingent on the behaviors of skilled manual 
work, teaching, unskilled manual work, nurs- 
ing, and many others. 

These characteristics of our reinforcing sys- 
tem and its rules of application, to continue 
with the example, have an effect on the manner 
in which behavior in our society is shaped. 
Consider, thus, a boy who has two classes of 
skilled behaviors—one a set of intellectual skills 
consisting of knowledge and well-developed 
study and scholarly work habits, and the other 
consisting of some form of fine athletic prowess. 
Let us say that either behavior could be devel- 
oped to “championship” caliber. Now, in a 
situation in which the larger amount of rein- 
forcing (attitudinal) stimuli is made contingent 
on the one behavior, this behavior will be 
strengthened, and as must be the case according 
to the principles of instrumental conditioning, 
it will be strengthened at the expense of the 
other behavior. In our society, of course, many 
of the strongest reinforcers are apt to be more 
liberally applied to athletic rather than schol- 
arly behavior. 

When groups are considered, it would also 
be expected that the reinforcer system and its 
rules of application will determine the types of 
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behaviors that are dominant. A society that has 
a differing set of reinforcers and rules will evi- 
dence different behavior over the group of 
people exposed to that set of conditions. A 
society, for example, whose reinforcers are made 
contingent on scholarly research behaviors, will 
to a larger extent than another society create 
stronger behaviors of that type and in a greater 
number of people. In general, all of the differ- 
ent cultural, national, and class behaviors that 
have been observed in sociology, anthropology, 
and other behavioral sciences, can be considered 
to involve this aspect of human motivation— 
the reinforcer system and its rules of applica- 
tion. 

The reinforcer system, including the many 
social attitudes, must also be considered to be 
of great importance in understanding abnormal 
behavior. An individual who has a markedly 
different reinforcer system than those in his 
group is likely to develop abnormal behavior. 
Thus the male for whom other males are strong 
sexual reinforcers is more likely to develop 
homosexual behaviors, for such behaviors will 
be strengthened by contact with those rein- 
forcers. The problem of behavioral treatment 
may thus involve changing the nature of the 
individual's reinforcer (attitudinal) system. 

These, of course, are simplified examples and 
take no account of other variables that will also 
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42. Attitudes Established by Classical Conditioning 


ARTHUR W. STAATS and CAROLYN K. STAATS 


. Osgood and Tannenbaum have stated, “. . . The 
meaning of a concept is its location in a space 
defined by some number of factors or dimen- 
sions, and attitude toward a concept is its pro- 
jection onto one of these dimensions defined 
as ‘evaluative’” (9, p. 42). Thus, attitudes 
evoked by concepts are considered part of the 
total meaning of the concepts. 

A number of psychologists, such as Cofer and 
Foley (1), Mowrer (5), and Osgood (6, 7), to 
mention a few, view meaning as a response— 
an implicit response with cue functions which 
may mediate other responses. A very similar 
analysis has been made of the concept of atti- 
tudes by Doob, who states, “ ‘An attitude is an 
implicit response . . . which is considered 
socially significant in the individual's society’ " 
(2, p. 144). Doob further emphasizes the learned 
character of attitudes and states, “The learning 
process, therefore, is crucial to an understand- 
ing of the behavior of attitudes" (2, p. 138). If 
attitudes are to be considered responses, then 
the learning process should be the same as for 
other responses. As an example, the principles of 
classical conditioning should apply to attitudes. 

'The present authors (12), in three experi- 
ments, recently conditioned the evaluative, po- 
tency, and activity components of word meaning 
found by Osgood and Suci (8) to contiguously 
presented nonsense syllables. The results sup- 
ported the conception that meaning is a response 
and, further, indicated that word meaning is 


* Reprinted from Journal of Abnormal and Social 
Psychology, 1958, 57, 37-40, with permission of the 
senior author and the American Psychological Asso- 
ciation, This article and the theoretical-experi- 
mental method were formulated by A. W. Staats 
as the principal investigator of a rescarch project 
supported by the Office of Naval Research under 
Contract Nonr-2305(00). C. K. Staats aided in the 
conduct of the study and was responsible for the 
method of statistical analysis of the data. 


composed of components which can be sepa- 
rately conditioned. 

The present study extends the original exper- 
iments by studying the formation of attitudes 
(evaluative meaning) to socially significant ver- 
bal stimuli through classical conditioning. The 
socially significant verbal stimuli were national 
names and familiar masculine names. Both of 
these types of stimuli, unlike nonsense syllables, 
would be expected to evoke attitudinal re- 
sponses on the basis of the pre-experimental 
experience of the Ss. Thus, the purpose of the 
present study is to test the hypothesis that atti- 
tudes already elicited by socially significant ver- 
bal stimuli can be changed through classical 
conditioning, using other words as uncondi- 
tioned stimuli. 


METHOD 


Subjects 

Ninety-three students in elementary psychol- 
ogy participated in the experiments as Ss to 
fulfill a course requirement. 


Procedure 

The general procedure employed was the 
same as in the previous study of the authors 
(12). 

Experiment I. 'The procedures were adminis- 
tered to the Ss in groups. There were two 
groups with one half of the Ss in each group. 
Two types of stimuli were used: national names 
which were presented by slide projection on a 
screen (CS words) and words which were pre- 
sented orally by the E(US words), with Ss re- 
quired to repeat the word aloud immediately 
after E had pronounced it. Ostensibly, Ss’ task 
was to separately learn the verbal stimuli simul- 
taneously presented in the two different ways. 

Two tasks were first presented to train the Ss 
in the procedure and to orient them properly 
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for the phase of the experiment where the hy- 
potheses were tested. The first task was to learn 
five visually presented national names, each 
shown four times, in random order. Ss’ learn- 
ing was tested by recall. The second task was to 
learn 33 auditorily presented words. Ss repeated 
each word aloud after E. Ss were tested by pre- 
senting 12 pairs of words. One of each pair was 
a word that had just been presented, and Ss 
were to recognize which one. 

The Ss were then told that the primary pur- 
pose of the experiment was to study “how both 
of these types of learning take place together 
—the effect that one has upon the other, and 
so on.” Six new national names were used for 
visual presentation: German, Swedish, Italian, 
French, Dutch, and Greek served as the CSs. 

These names were presented in random order, 
with exposures of five sec. Approximately one 
sec. after the CS name appeared on the screen, 
E pronounced the US word with which it was 
paired. The intervals between exposures were 
less than one sec. Ss were told they could learn 
the visually presented names by just looking at 
them but that they should simultaneously con- 
centrate on pronouncing the auditorily pre- 
sented words aloud and to themselves, since 
there would be many of these words, each pre- 
sented only once. 

The names were each visually presented 18 
times in random order, though never more than 
twice in succession, so that no systematic asso- 
ciations were formed between them. On each 
presentation, the CS name was paired with a 
different auditorily presented word, i.e., there 
were 18 conditioning trials. CS names were 
never paired with US words more than once so 
that stable associations were not formed be- 
tween them. Thus, 108 different US words were 
used. The CS names, Swedish and Dutch, were 
always paired with US words with evaluative 
meaning. The other four CS names were paired 
with words which had no systematic meaning, 
€g., chair, with, twelve. For Group 1, Dutch 
was paired with different words which had posi- 
tive evaluative meaning, e.g., gift, sacred, happy; 
and Swedish was paired with words which had 
negative evaluative meaning, e.g., bitter, ugly, 
failure For Group 2, the order of Dutch and 


1 The complete list of CS-US word pairs is not 
presented here, but it has been deposited with the 
American Documentation Institute, Order Docu- 
ment No. 5463 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Con- 
gress, Washington 25, D. C., remitting in advance 


Swedish was reversed so that Dutch was paired 
with words with negative evaluative meaning 
and Swedish with positive meaning words. 

When the conditioning phase was completed, 
Ss were told that E first wished to find out how 
many of the visually presented words they re- 
membered. At the same time, they were told, it 
would be necessary to find out how they felt 
about the words since that might have affected 
how the words were learned. Each $ was given 
a small booklet in which there were six pages. 
On each page was printed one of the six names 
and a semantic differential scale. The scale was 
the seven-point scale of Osgood and Suci (8), 
with the continuum from pleasant to unpleas- 
ant. An example is as follows: 


pleasant:—:—:—:—:—:—:—: unpleasant 


The Ss were told how to mark the scale and 
to indicate at the bottom of the page whether 
or not the word was one that had been pre- 
sented. 

The Ss were then tested on the auditorily 
presented words. Finally, they were asked to 
write down anything they had thought about 
the experiment, especially the purpose of it, 
and so on, or anything they had thought of 
during the experiment. It was explained that 
this might have affected the way they had 
learned. 

Experiment II, The procedure was exactly 
repeated with another group of Ss except for 
the CS names. The names used were Harry, 
Tom, Jim, Ralph, Bill, and Bob. Again, half of 
the Ss were in Group 1 and half in Group 2. 
For Group 1, Tom was paired with positive 
evaluative words and Bill with negative words. 
For Group 2 this was reversed, The semantic 
differential booklet was also the same except 
for the CS names. 


Design 

The data for the two experiments were 
treated in the same manner. Three variables 
were involved in the design: conditioned mean- 
ing (pleasant and unpleasant); CS names (Dutch 
and Swedish, or Tom and Bill); and groups (1 
and 2). The scores on the semantic differential 
given to each of the two CS words were ana- 
lyzed in a 2x2 latin square as described by 
Lindquist (4, p. 278) for his Type II design. 
$1.25 for microfilm or $1.25 for photocopies. Make 


checks payable to Chief, Photoduplication Service, 
Library of Congress. 
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RESULTS 


The 17 Ss who indicated they were aware of 
either of the systematic name-word relationships 
were excluded from the analysis. This was done 
to prevent the interpretation that the condi- 
tioning of attitudes depended upon awareness. 
In order to maintain a counterbalanced design 
when these Ss were excluded, four Ss were ran- 
domly eliminated from the analysis. The result- 
ing Ns were as follows: 24 in Experiment I and 
48 in Experiment II. 

Table 1 presents the means and SDs of the 
meaning scores for Experiments I and I. The 


TABLE 1. Means and SDs of Conditioned 
Attitude Scores 


Names 
Experi- Dutch Swedish 
ment Group Mean SD Mean SD 
I 1 2.67 .94 3.42 1.50 
2 2.67 1.31 1.83 90 
Experi- Tom Bill 
ment Group Mean SD Mean SD 


II 1 2.71 2.01 4.12 2.04 
2 3.42 2.55 1.79 1.07 


Note. On the scales, pleasant is 1, unpleasant T 


table itself is a representation of the 2 x 2 de- 
sign for each experiment. The pleasant extreme 
of the evaluative scale was scored 1, the un- 
pleasant 7. 

The analysis of the data for both experiments 
is presented in Table 2. The results of the anal- 
ysis indicate that the conditioning occurred in 
both cases. In Experiment I, the F for the con- 
ditioned attitudes was significant at better than 
the .05 level. In Experiment II, the F for the 


TABLE 2. Summary of the Results of the 
Analysis of Variance for Each Experiment 
———————————————— 


Experiment I Experiment II 


Source d MS F df MS F 
Between Ss 
Groups 1 7.52 4.36* 1 1584 5.00° 
Error 22 1.73 46 317 
Within 
Conditioned 1 752 552° 1 55.51 1047** 
attitude 
Names 1 02 Ol 1 26 05 
Residual 22 136 46 530 
Total 47 95 


conditioned attitudes was significant at better 
than the .01 level. In both experiments the F 
for the groups variable was significant at the 
.05 level. 


DISCUSSION 


It was possible to condition the attitude com- 
ponent of the total meaning responses of US 
words to socially significant verbal stimuli, with- 
out Ss’ awareness. This conception is schema- 
tized in Figure 1, and in so doing, the way the 


cs US R 
Dutch F ==>- 
Eea Aa s r 
mc su —— p 
————— 
77 Pretty = Rp 
Dutch =——— 
`~ Sig T Tpy 


Figure 1. The conditioning of a positive attitude. 
The heaviness of line represents strength of asso- 
ciation. 


conditioning in this study was thought to have 
taken place is shown more specifically. The 
national name Dutch, in this example, is pre- 
sented prior to the word pretty. Pretty elicits a 
meaning response. This is schematized in the 
figure as two component responses; an evalua- 
tive response rpy (in this example, the words 
have a positive value), and the other distinctive 
responses that characterize the meaning of the 
word, Rp. The pairing of Dutch and pretty 
results in associations between Dutch and rpy, 
and Dutch and Ry. In the following presenta- 
tions of Dutch and the words sweet and healthy, 
the association between Dutch and rp, is fur- 
ther strengthened. This is not the case with 
associations Rp, Rg, and Ry since they occur 
only once and are followed by other associa- 
tions which are inhibitory. The direct associa- 
tions indicated in the figure between the name 
and the individual words would also in this 
way be inhibited. i 

It was not thought that a rating response 
was conditioned in this procedure but rather 
an implicit attitudinal response which mediated 
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the behavior of scoring the semantic differen- 
tial scale. It is possible, with this conception, to 
interpret two studies by Razran (10, 11) which 
concern the conditioning of ratings. Razran 
found that ratings of ethnically labeled pictures 
of girls and sociopolitical slogans could be 
changed by showing these stimuli while Ss were 
consuming a free lunch and, in the case of the 
slogans, while the Ss were presented with un- 
pleasant olfactory stimulation. The change in 
ratings could be thought to be due to the con- 
ditioning of an implicit evaluative response, 
an attitude, to the CSs by means of the lunch 
or the unpleasant odors. That is, part of the 
total response elicited by the food, for example, 
was conditioned to the pictures or slogans and 
became the mediation process which in turn 
elicited the positive rating. 

It should be stated that the results of the 
present study do not show directly that Ss’ be- 
havior to the object (e.g., a person of Dutch 
nationality) has been changed. The results per- 
tain to the Ss’ attitudinal response to the signs, 
the national names themselves. However, Ka- 
pustnik (3) has demonstrated that a response 
generalized to an object when the response had 
previously been conditioned to the verbal sign 
of the object. Osgood states: 


The aggressive reactions associated with Nazi. 


and Jap on a verbal level certainly transferred 
to the social objects represented under appro- 
priate conditions. Similarly, prejudicial behav- 
iors established while reading about a member 
of a social class can transfer to the class as a 
whole . . . (7, p. 704). 


The results of this study have special rele- 
vance for an understanding of attitude forma- 
tion and change by means of verbal communi- 
cation. Using a conception of meaning as a 
mediating response, Mowrer (5) has suggested 
that a sentence is a conditioning device and 
that communication takes place when the mean- 
ing response which has been elicited by the 
predicate is conditioned to the subject of the 
sentence. The results of the present study and 
the previous one of the present authors (12) 
substantiate Mowrer's approach by substanti- 
ating the basic theory that word meaning will 


indeed condition to contiguously presented 
verbal stimuli. In the present study, the mean- 
ing component was evaluative, or attitudinal, 
and the CSs were socially significant verbal 
stimuli. The results suggest, therefore, that at- 
titude formation or change through communi- 
cation takes place according to these principles 
of conditioning. As an example, the sentence, 
"Dutch people are honest," would condition the 
positive attitude elicited by “honest” to "Dutch" 
—and presumably to any person called “Dutch.” 
If, in an individual's history, many words elic- 
iting a positive attitude were paired with 
"Dutch," then a very positive attitude toward 
this nationality would arise. 

The reason for the group differences in each 
of the experiments is not clear. These differ- 
ences could have arisen because there were ac- 
tual differences in the Ss composing each group, 
or in some condition of the procedure occurring 
to one of the groups. Nothing the authors were 
aware of seem to indicate this as the explana- 
tion, and in the previous experiments of the 
authors (12) there were no group differences. 
Since in a 2 x 2 latin square the interactions 
are entirely confounded with the main effects, 
the group differences could also have arisen as 
a result of the interaction of the other two 
main effects (i.e., direction of conditioning and 
names). 


SUMMARY 


Two experiments were conducted to test the 
hypothesis that attitude responses elicited by 
a word can be conditioned to a contiguously 
presented socially significant verbal stimulus. A 
name (e.g., Dutch) was presented 18 times, each 
time paired with the auditory presentation of 
a different word. While these words were differ- 
ent, they all had an identical evaluative mean- 
ing component. In Experiment I, one national 
name was paired with positive evaluative mean- 
ing and another was paired with negative eval- 
uative meaning. In Experiment II, familiar 
masculine names were used. In each experiment 
there was significant evidence that meaning re- 
sponses had been conditioned to the names 
without Ss’ awareness, 
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43. A Concept-Formation Approach to Attitude Acquisition 


RAMON J. RHINE 


The purpose of this paper is to attempt a 
theoretical formulation of attitude development 
from a concept-formation point of view. Me- 
diating responses have been employed by Os- 
good (21) to explain concept formation, and a 
mediating or implicit response has also been 
used as a main theoretical construct in Doob’s 
(8) outline of attitude theory. To this extent, 
there is already an established theoretical link 
between concepts and attitudes. 

Doob calls attitude “an implicit drive-produc- 
ing response considered socially significant in 
the individual's society" (8, p. 136). This view- 
point has been criticized by Chein for a number 
of reasons, including the fact that it does not 
clearly indicate the evaluative nature of atti- 
tudes. Chein would rather call attitude a “dis- 
position to evaluate certain objects, actions and 
situations in certain ways" (8, p. 177). Although 
many researchers in the field of attitudes might 
concur with Chein on this last point, a single 
meaning of attitude upon which there is close 
agreement is not available. Nelson's (20) review 
of the literature disclosed 23 more or less dif- 
ferent definitions of attitude, and more recent 
statements could expand the list even further. 
No one definition seems clearly superior solely 
on logical grounds; if it were otherwise, there 
would not be so many alternatives. 

The various definitions name constructs, and 
presumably some constructs have more scien- 
tific value than others. The psychologist, seek- 
ing a guide for his research among many alter- 
natives, looks for a construct with heuristic and 
predictive value, particularly a construct which 
leads to informative research. Insofar as attitude 
change is concerned, Doob's type of approach 
has stimulated considerable research. But in the 


* Reprinted from Psychological Review, 1958, 65, 
362-370, with permission of the author and the 
American Psychological Association. 
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area of attitude development, very little care- 
fully controlled experimentation has been done. 
Besides the research of Eisman (9), there is 
little experimentation available on attitude ac- 
quisition, although studies such as those by 
Horowitz (13) and by Clark and Clark (4) pro- 
vide information about attitude growth. If it 
is possible to explain attitude learning in terms 
of concept formation, then attitude develop- 
ment is opened to controlled experimentation 
through concept-formation methods. A concept- 
formation approach to attitude learning also 
enables any systematic viewpoint which can 
account for concept formation to account at the 
same time for attitude formation. 


THE GENERAL MEANING OF AN ATTITUDE 


A concept is sometimes thought of as a psy- 
chological mechanism that represents a set of 
stimulus patterns; that is, a concept is consid- 
ered a mental principle through which an in- 
dividual can classify a number of objects in his 
stimulus world. Thus, redwood, elm, pine, fir, 
and oak fall into the class of "tall, nonanimal, 
living things" and tree is the name given to 
this principle. Similarly, the concept "a person 
who speaks with a drawl and has dark skin, 
thick lips, and kinky hair" classifies American 
Negroes for some people. Of course, the con- 
cepts learned in real life are not always verbal- 
ized into a precise principle. In fact, behavior 
in laboratory studies (19) indicates that a con 
cept can be learned even though S is unaware 
of it and is unable to verbalize the principle 
on which the concept is based. The same phe- 
nomenon probably occurs with attitudes. People 
often betray by their behavior attitudes which 
they do not recognize as their own. 

On a common-sense level, concepts are devel- 
oped over a period of time through a series of 
experiences. The child learns the concept cow 
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because his parents and others point to objects 
in the field or in pictures and call them by 
that name. He learns that color alone is not 
the key to cow-ness, but that it is helpful. All 
black objects are not called cow, but green 
objects are never called by that name. The 
child soon learns to discriminate between cows 
and other objects as he learns that cow-ness is 
a feature of not one stimulus pattern but of a 
whole class of patterns. 

The child learns about people in much the 
same manner. He discovers that one dimension 
of Negro-ness is dark skin, but that a deep 
suntan does not make one a Negro. He grad- 
ually learns that Negro-ness involves other 
dimensions such as facial characteristics and 
hair properties. This learning eventually ena- 
bles him to make fairly accurate discrimina- 
tions between Negroes and other persons. So 
far, the child has a concept of a Negro but not 
an attitude, because an attitude involves an 
evaluative dimension. Suppose, however, he 
has some unpleasant personal experiences with 
Negroes or receives adverse information about 
them from his parents. Then the child’s con- 
cept might be based on the dimensions, “dark 
skin, kinky hair, thick lips, speaks with a 
drawl, and bad.” Since an evaluative dimension 
now appears in the concept, he has acquired 
an attitude, 

This approach to attitudes is not an arbi- 
trary substitute for other meanings of attitude, 
but is representative of commonly used mean- 
ings, as may be seen from the following 
quotations: 

“Most researchers in attitudes agree that one 
of the functions of attitudes is to integrate 
social perceptions and experiences. Thus our 
experiences with our mothers, sisters, girl 
friends and maiden aunts are integrated . . . 
into an attitude toward women” (16, p. 292). 
". . . attitude [is] a more or less permanently 
enduring state of readiness of mental organi- 
zation which predisposes an individual to react 
in a characteristic way to any object or situa- 
tion with which it is related" (1, p. 13). “The 
attitude is a way of conceiving an object; it is 
the mental counterpart of an object" (10, p. 
11). “. . . the meaning of a concept is its loca- 
tion in a space defined by some number of 
factors or dimensions, and attitude toward a 
concept is its projection onto one of these 
dimensions defined as ‘evaluative’ " (22, p. 42). 
"An attitude, roughly, is a residuum of experi- 
ence, by which further activity is conditioned 


and controlled . . . an inner mental organiza- 
tion takes place which predisposes the person 
to a certain type of activity towards objects, 
persons, and situations" (17, p. 238). 

The thread which runs through these 
quotations—"experiences integrated into an 
attitude,” “mental organization,” “mental 
counterpart,” "a way of conceiving"—is that 
an attitude is a kind of mental organization 
that is built through many experiences. In a 
sense, as a result of varied contacts with a class 
of objects, a concentrated representation of a 
set of stimuli is left within the person, which 
is the essence of what is generally meant by a 
concept. 


AN EXPLANATION OF ATTITUDE 


If an attitude is taken to be a concept with 
an evaluative dimension, its meaning may be 
derived from an explanation of concept for- 
mation. Consequently, any theory which can 
explain concept formation can also help to 
explain attitude development. One of several 
possible explanations of concept formation is 
provided by S-R Theory. 

Borrowing from Osgood (21), a concept may 
be defined as the associations between a com- 
mon response (often verbal) and a set of stim- 
uli. These stimuli frequently comprise a class 
of phenomena which display certain common 
characteristics. The common characteristic of 
the set of phenomena may be the identical 
elements with which Hull (14) worked, such as 
a common shape embedded in each of several 
different Chinese characters. Or, as Smoke (30) 
has demonstrated, it may be common percep- 
tual relations when, for instance, Ss view a 
series of different figures and classify together 
all those having perpendicular lines in them. 
Although the stimuli mentioned in the defi- 
nition of a concept often fall into a class, 
Osgood suggests that the only essential condi- 
tion for concept formation is the associations 
between a common response and a variety of 
stimuli. There are no obvious stimulus char- 
acteristics common to hour-glass, ruler and 
mental test, all of which are measuring instru- 
ments; or to file cabinet, typewriter, and desk, 
all of which are office equipment. These items 
have in common the overt response of naming 
them, and, of greater theoretical importance, 
a common mediating process. 

The mediating response, Hull's pure stimu- 
lus act, can be any fraction of a total response 
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pattern. Partly as a result of the organism's 
capacity to discriminate, the mediating response 
will tend to become as fractionated as possible 
without destroying its cue function. 


The greater the discriminatory capacity of an 
organism, the more reduced and implicit can 
become the "detachable" reactions finally in- 
cluded in the stable mediation process The 
higher the organism in the evolutionary scale, 
the finer the discriminations it can usually 
make and the less gross its representing 
processes. Similarly, the more mature and intel- 
ligent the human individual, the less overt his 
symbolic processes. The hosts of fine discrimina- 
tions that characterize language behavior are 
Nature's farthest step in this direction (21, 
p- 398). 

Of all possible human mediators, probably 
the most significant are verbal mediators. Re- 
search by Cofer and Foley and associates (5, 6, 
7, 12), Riess (28, 29), and others demonstrates 
the significance for language behavior of medi- 
ating responses. Other investigations by Reed 
(23, 24, 25), Kendler and Karasik (15), and by 
Goss and associates (2, 11) show the importance 
of verbal mediators in concept formation. 
Studies by Eisman (9), Rhine (26, 27), and 
Staats and Staats (31) show how verbal medi- 
ators enter into attitudes. 

'The general model for a simple, first-order 
concept based on either verbal or nonverbal 
mediators is illustrated in Part A of Figure 1. 
Many stimuli are conditioned to a common 
mediating response. By virtue of the common 
mediator, these stimuli become organized into 
one system, It is this system of associations be- 
tween the stimuli and the common mediating 
process which is the concept. For the concept 
gray, the stimuli of Part A of Figure 1 would 


Ei 


4 Sn- 
cS, ----- =s ; ^ 
ber 


s > il 
sn- 
S 


(A) (B) 


be various acromatic shades all of which arouse 
the same mediating response. This response 
provides a stimulus generally conditioned in 
adults to the overt, verbal label gray. 

A more complex or second-order concept is 
illustrated in Part B of Figure 1. Each set of 
stimuli with their associated mediators (r,'s) 
are first-order concepts. In the second-order 
concept, the stimuli produced by the first-order 
mediators are themselves associated with an- 
other common mediator. One set of first-order 
stimuli in Part B of Figure 1 might be different 
skin shades leading to the mediator represent- 
ing dark skin; the second set of first-order 
stimuli might be various lip thicknesses which 
are associated with the mediator for thick lips; 
and each of the stimuli from these mediating 
responses may be associated with the mediator 
for Negro (rj. This illustrates a second-order 
concept, but there is no attitude as yet because 
no evaluation is involved. 

There is an attitude when the mediator of 
at least one of the first-order concepts is an 
evaluative reaction. Thus, attitudes are equiva- 
lent to that special class of concepts which is 
distinguished by the inclusion of an evaluative 
dimension. Part C of Figure 1 is a simplified 
illustration of an attitude. Suppose, as in the 
example of a second-order concept, that one r, 
of this figure is the mediator for thick lips, 
that the second r, represents dark skin, and 
that r, is the mediator for Negro. Suppose also 
that the stimuli (S,) of the evaluative dimen- 
sion are dirty, stupid, and rude and that Tı, 
the evaluative reaction, is bad. Now the stimuli 
associated with Negro are dark skin, thick lips, 
and bad. The inclusion of an evaluative dimen- 
sion identifies Part C of Figure 1 as an 
attitude. 

The mediating response produces a stimulus 
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Figure 1. A stimulus-response explanation of concepts and attitude. (A) First-order 
concept; (B) second-order concept; (C) attitude. S — evaluative stimuli; moa 
first-order mediator; r, — a first-order evaluative mediator; r, = a second-order 
mediator; s — the stimulus produced by a mediator. 
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the attitude as shown 


in A of Figure 1), and verbal (r,) and affective (r,) mediators. (B) A stimulus- 
response explanation of prejudice through displaced aggression. 


which may then become conditioned to other 
mediating or instrumental reactions. Part A 
of Figure 2 shows some of the major associ- 
ations possible between the attitude and these 
other mediating responses. It indicates how 
instrumental responses may depend upon a 
chain of associations involving two major types 
of mediators, namely, affective and verbal. 
Part A of Figure 2 is not meant to imply that 
in any given instance all the illustrated associ- 
ations between the attitude and verbal or affec- 
tive mediators will be formed. Instead, it 
indicates that one or more of the associations 
may be learned by a given individual. For in- 
stance, as is shown in Part B-l of Figure 2, 
only some of the relations between the attitude, 
affect, and the instrumental response may be 
learned in an individual case. 

One implication of Figure 2 is particularly 
interesting. A reaction conditioned to the atti- 
tude may be aroused by a stimulus which is 
associated with affect but not with the attitude. 
This situation suggests an explanation for 
prejudiced responses through displaced aggres- 
sion. Suppose an individual has a negative 
attitude towards Negroes. Suppose further that 
the affect of Part B of Figure 2 is the mediator 
for anger and the instrumental response is 
aggression toward Negroes. Assume also, as 
Part B-1 of Figure 2 indicates, that the attitude 
leads to the simultaneous arousal of anger and 
of aggression against Negroes. Eventually, 
through conditioning, anger alone can lead to 
aggression. Now, as indicated by Part B-2 of 
Figure 2, any stimulus (S,) which arouses anger 
will also have a tendency to evoke aggression 
towards Negroes, depending upon the degree 
to which this stimulus is supported by other 
stimuli. As a result, there is a tendency to dis- 
Place aggression from S, the real source of 
anger, to Negroes. 


Other specific predictions follow from the 
general relations shown in Part A of Figure 2. 
The relationship between verbal mediating re- 
sponses and affect suggests one reason why atti- 
tudes are commonly accompanied by affective 
reactions. If a given verbal mediator, through 
past experience, has become associated with 
affect, then the arousal of this mediator by 
an attitude will also lead to an affective 
response: the attitude leads to the verbal medi- 
ator and the verbal mediator is associated with 
affect. Suppose, for instance, that an attitude 
supplies the conditioned stimulus which evokes 
the verbal mediator for green vomit. Then, in- 
sofar as this mediator evokes an emotional 
reaction, the attitude will be accompanied by 
affect. This particular verbal mediator was used 
as an example because in some cases it will 
make the illustration more vivid by actually 
arousing an emotional reaction in the reader. 

There is probably a more important reason 
why attitudes are commonly linked to affect. 
It is quite often true that the experiences 
through which the attitude is developed occur 
in a charged atmosphere. In this case, the medi- 
ating response of the attitude is continuously 
aroused in conjunction with an emotional 
reaction, and the stimulus produced by the 
mediating response becomes a conditioned 
stimulus capable of evoking effect. As a result, 
the expression of this attitude is accompanied 
by an emotional response. 


A METHOD FOR STUDYING ATTITUDE 
DEVELOPMENT 


One of the main values of the concept- 
formation approach to attitude is that it sug- 
gests a simple laboratory method for studying 
attitude development. If an attitude is a kind 
of concept, then its acquisition may be inves- 
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tigated by the methods of concept formation. 
Such a method, depending on an evaluative 
dimension in the concept, has been employed 
by the writer. In order to construct an evalu- 
ative dimension, 100 Ss were asked to rate a 
large number of descriptive traits in terms of 
their desirability in a person. Some example 
traits are: ambitious, boring, calm, intelligent, 
obedient, and vain. All traits showing less than 
90% agreement as to their desirability were 
discarded. The remaining traits can be em- 
ployed to study the development of an attitude 
and to measure its strength and resistance to 
change. To do this, a group of Ss is told that 
anthropologists, missionaries and physicians 
have visited a group of people about which 
little is known, and that all the visitors agree 
on certain characteristics of this group of 
people. When these instructions are concluded, 
Ss are shown a number of traits, one by one, 
and are asked to predict whether each trait is 
or is not characteristic of the people. After each 
prediction they are told whether or not the 
word is indeed characteristic. For example, to 
develop a positive attitude, E would say “char- 
acteristic” after presenting each trait previously 
rated as desirable and "not characteristic" for 
words previously rated undesirable. 

If S has learned the positive attitude, which 
might be labeled "pro-ness," he would be ex- 
pected to respond accordingly. Hence, when he 
has completed the first list of traits, S is asked 
to continue responding in the previous manner 
to a second list of words without being told 
which traits are characteristic or not character- 
istic. Responses under this condition are taken 
as a measure of attitude strength. It is assumed 
that a strongly acquired, positive attitude will 
lead to a high proportion of positively rated 
traits being called "characteristic" and to most 
negatively rated traits being called not "charac- 
teristic,” 

Finally, resistance to attitude change is mea- 
sured by using a third list of traits with E again 
saying “characteristic” or “not characteristic” 
after each prediction. But this time E's re- 
Sponses are reversed. For example, if $ has pre- 
viously acquired a positive attitude, change 
may be accomplished by calling "not charac- 
teristic” all positive traits in the third list and 
"characteristic" all negative traits. 

This method is derived from the concept- 
formation approach to attitudes. The meaning 
of the many traits seen in a single session varies 
considerably, as can be observed from the ex- 


amples given above. Although the traits are 
not synonyms, they do exhibit one common 
aspect, the evaluative element. The Ss are able 
to respond correctly when this evaluative ele- 
ment is consistently reinforced by E because 
the traits are stimuli which are capable of 
arousing a common, evaluative mediating re- 
sponse. The mediating process, in turn, deter- 
mines S's further predictions of "characteristic" 
or “not characteristic.” 

The stimulus function of mediating re- 
sponses in attitude development is demon- 
strated by two experiments which also illustrate 
the mediating response approach as applied 
through the conceptattitude method. A nega- 
tive attitude was developed in both investiga- 
tions. In one study (27) consistency of 
reinforcement was defined as the sum of the 
number of positive traits called "not charac- 
teristic’ by E and the number of negative traits 
called "characteristic," divided by the total 
number of traits used during attitude develop- 
ment. Using five levels of consistency, it was 
shown that both attitude development and 
strength increase as a function of consistency 
of reinforcement, as would be predicted from 
the approach to attitude development based on 
mediating responses. The results for attitude 
change also suggest that attitudes exhibit prop- 
erties of mediating responses. Research on 
acquired distinctiveness of cues (18) indicates 
that strong habits reverse more easily than 
weak ones when the cues arousing them have 
become distinct through learning. The distinc- 
tiveness of the stimulus dimension is believed 
to be related to the degree to which the stimuli 
are conditioned to a mediating process. The 
evaluative element of the stimulus traits should 
be most distinct when reinforcement is fully 
consistent; consequently, attitude — change 
should be easiest under this condition. The Ss 
receiving the greatest consistency of reinforce- 
ment did in fact show least resistance to change. 

A second experiment (26) illustrating the 
conceptattitude method investigated the effect 
on attitude development of peer responses. 
The traits were used with 10095 and 84% con- 
sistency of reinforcement to develop a negative 
attitude. Some Ss made their predictions after 
first hearing the responses of three fellow stu- 
dents who were E's confederates, and other Ss 
made their predictions in private. The con- 
federates’ responses prior to S's predictions 
were always or almost always in contradiction 
with the statement E made after each predic- 
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tion. Thus, when the confederates said a trait 
was characteristic, it was pretty certain that E 
would say the trait was not characteristic. The 
Ss who heard the confederates’ responses 
learned the attitude with greater ease than 
those who responded in private. Apparently, 
peer responses provided cues which were an 
aid to S in attaining the attitude. This seemed 
to occur mainly with high consistency of rein- 
forcement. With less than full consistency of 
reinforcement, peer responses were predictive 
of E's behavior for a majority of the traits, 
but not all of them. When this ambiguity is 
introduced, cues from peer responses do not 
affect attitude development at a statistically 
significant level, although there is a slight trend 
in this direction. 

These results may be explained by consider- 
ing the effect of peer responses upon the 
arousal of the common, evaluative mediating 
response. With 100% consistency of reinforce- 
ment, a negative mediating reaction is aroused 
by a trait just prior to reinforcement and in 
conjunction with peer responses of “character- 
istic.” Similarly, a positive mediating reaction 
is aroused in conjunction with peer responses 
of “not characteristic.” In time, as a result of 
conditioning, the peer responses can be ex- 
pected to help cue the appropriate positive or 
negative mediating response. With 84%, consis- 
tency of reinforcement, the trend of condition- 
ing is the same, but a minority of trials tend 
to establish competing, mediating responses: 
the same peer reactions are sometimes associ- 
ated with positive mediating responses and 
sometimes with negative. Under these condi- 
tions, peer responses are a less stable and less 
predictive aspect of the stimulus environment. 
This would account for the statistically signifi- 
cant effect from peer responses with high con- 
sistency of reinforcement, which is diminished 
with lesser consistency. 
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44. A Behavior Theory Approach to the Relations between 
Beliefs about an Object and the Attitude Toward the 


Object 


MARTIN FISHBEIN 


In another paper in this book, Fishbein (see 
pp- 257 to 266) presented a distinction between 
beliefs about an object and the attitude toward 
that object. (Generally speaking, attitudes were 
conceptualized as learned predispositions to 
respond to an object or class of objects in a 
consistently favorable or unfavorable way, and 
beliefs about an object were viewed as hy- 
potheses concerning the nature of the object 
and its relations to other objects) Consistent 
with the works of Doob (1947), Lott (1955), 
Osgood et al. (1957), Rhine (1958), and others, 
both of these phenomena (i.e. beliefs about 
an object and attitude) may readily be placed 
within the framework of behavior theory. As 
Lott has pointed out, the placement of attitude 
within the framework of behavior theory "ap- 
pears to be a more research-oriented and poten- 
tially fruitful approach to the problem." (p. 
321). Indeed, by following the principles of 
behavior theory, a model of attitude acquisition 
and a model of the relationships between be- 
liefs about an object and the attitude toward 
the object can be generated. Before turning to 
these models, however, let us briefly consider 
the behavior theory view of attitudes and be- 
liefs about an object. 


ATTITUDE 


Consistent with the work of Osgood and his 
associates o" 1965), an attitude may be char- 
acterized as a “mediating evaluative response,” 
that is, as a learned implicit response that 
varies in intensity and tends to “mediate” or 
guide an individual's more overt evaluative 


* Specially prepared for this book. 


responses to an object or concept. Two points 
about this view should be noted. First, attitude 
is treated as a unidimensional concept; it refers 
* only to the “evaluation” of a concept (i.e., its 
i: ness" or "badness"). In this respect, it is 
entirely consistent with Thurstone’s (1931) defi- 
nition of attitude as "the amount of affect for 
or against a psychological object." Second, as 
Osgood et al. have pointed out, "every point 
in semantic space has an evaluative component 
(even though the component may be of zero 
magnitude when the evaluative judgments are 
neutral)" Thus, with respect to any object, 
an individual has a positive, negative, or neu- 
tral attitude; that is, there is a mediating eval- 
uative response associated with every stimulus. 


BELIEF ABOUT AN OBJECT 


As Fishbein (see pp. 257 to 266 in this book) 
has pointed out, any belief about an object 
can be defined in terms of the "probability" 
or "improbability" that a particular relation- 
ship exists between the object of belief (e.g., 
an attitude object) and any other object, con- 
cept, value, or goal. If the object of belief (i.e., 
the attitude object) is viewed as a "stimulus" 
and if the object or concept related to the ob- 
ject of belief is viewed as a “response,” a belief 
statement may be viewed as a stimulus-response 
association. Thus a belief about an object may 
be seen as being highly related to the proba- 
bility that the stimulus elicits the response, 
that is, to the probability that there is an asso- 
ciation between the stimulus (the attitude ob- 
ject) and the response (any other concept). 
Furthermore, it should be noted that an indi- 
vidual has many beliefs about any aspect of 
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his world. That is, an individual associates 
many different concepts with any given atti- 
tude object. The totality of an individual's 
beliefs about an object can thus be viewed as 
a belief system. In addition, this system of re- 
sponses associated with a given stimulus may 
also be viewed as a habit-family-hierarchy of 
responses (Hull, 1943). The higher the response 
in the hierarchy, the greater is the probability 
that the response is associated with the stimu- 
lus, that is, the stronger is the belief. Empirical 
evidence supporting this conception of a belief 
system has been presented by Fishbein (1963). 

To summarize, both beliefs about an object 
and the attitude toward the object have been 
placed within a behavior theory framework. 
Attitudes have been viewed as learned, mediat- 
ing evaluative responses, and beliefs about an 
object have been viewed in terms of the proba- 
bility (or strength) of stimulus-response associ- 
ations. Furthermore, a belief system has been 
conceptualized as a habit-family-hierarchy of 
responses. Given these descriptions, a model of 
attitude acquisition and a model of the rela- 
tionships between beliefs and attitudes can be 
developed by following the principles of be- 
havior theory and, in particular, the principle 
of mediated (ie. secondary or conditioned) 
generalization (see Birge, 1941; Cofer and Foley, 
1942; Murdock, 1952; Mednick, 1957). 


4 


1. A MODEL OF ATTITUDE ACQUISITION 


In the foregoing article in this book (pp. 
382 to 388), Rhine, working from Osgood’s 
definition of attitude (i.e, “the evaluative di- 
mension of a concept”), develops a model of 
attitude acquisition based on a consideration 
of the process of concept formation. Rhine's 
model may be seen in Figure 1. To review 
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briefly, Rhine argues that no attitude is present 
in either A (first-order concept) or B (second- 
order concept) of Figure 1. However, “there is 
an attitude when the mediator of at least one 
of the first-order concepts is an evaluative one.” 
Thus, as Rhine views it, we first acquire the 
first-order mediators (or concepts) “thick lips” 
and “dark skin" from sets of first-order stimuli 
of lip thicknesses and skin shades. When the 
stimuli produced by these mediators come to 
elicit the second-order mediator "Negro," we 
have a second-order concept, but not an atti- 
tude (see Figure 1B). It is only when a series 
of first-order “evaluative stimuli" elicit a first- 
order evaluative mediator (eg. "bad"), and 
the stimulus produced by this evaluative medi- 
ator also comes to elicit the second-order 
mediator “Negro,” that an attitude is present 
(see Fig. 1C). Thus, according to this con- 
ceptualization, we may acquire the concept 
“Negro” and not have an attitude toward 
“Negro.” The concept and the attitude can 
be learned independently or simultaneously. 
Similar views of the relationship between con- 
cept learning and attitudes have been presented 
by Clark and Clark (1958) and Allport (1954). 

However, as was pointed out in the discussion 
of attitude, all concepts contain an evaluative 
component. That is, there is an attitude, an 
evaluative response associated with all concepts. 
Furthermore, it should be noted that as Osgood 
used the term “concept,” it referred to any 
discriminable aspect of the individual's world; 
it might refer to any object, person, word, 
groups of words, and so on. Thus, as this term 
is used by Osgood, all "stimuli" and all “re- 
sponses" (verbalizable or not) are viewed as 
concepts. 

In A of Figure 1, then, all the “first-order 
stimuli” of various skin shades eliciting the 
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Figure 1. Ramon J. Rhine's theory of attitude acquisition (from Rhine, 1958). (A) 

First-order concept; (B) second-order concept; (C) attitude. § — a first-order 

Stimulus; S, = an evaluative stimulus; r, = a first-order mediator; tr, a first- 
e 

order evaluative mediator; r, — a second-order mediator; s — the stimulus pro- 


duced by a mediator. 
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Figure 2. A general model of attitude acquisition. (A) First-order concept with 
attitude; (B) second-order concept with attitude. S — a stimulus; r; = @ first-order 
mediator; LES first-order evaluative mediator; r, — a second-order mediator; 
Ty, = a second-order evaluative mediator; s = the stimulus produced by a mediator. 
The higher the stimulus in the stimulus hierarchy, the greater the probability 
that it will elicit the concept (either first-, second-, or n-order). The evaluative 
mediators represent a summation of the evaluative responses elicited by the 
stimuli. The strength of the evaluative mediator, ie. the attitude, is a function 
of the evaluative dimension of each of the stimuli and the probability that the 


stimuli elicit the concept. 


mediator or concept “dark skin” also elicit an 
evaluative response, These evaluative responses 
are seen as being summative; through the medi- 
ation process, the summated evaluative response 
is associated with the mediator or concept “dark 
skin.” Thus, on future occasions, “dark skin” will 
elicit an evaluative response that is a function 
of the stimuli that elicit it (ie. dark skin). 
This may be seen in A of Figure 2. Similarly, 
the stimuli produced by other first-order con- 
cepts or mediators (e.g., thick lips, curly hair) 
will elicit evaluative responses at the same time 
that they elicit the “second-order mediator” or 
concept “Negro.” Therefore, on future occa- 
sions, the concept or stimulus “Negro” will 
elicit an evaluative response, an attitude, that 
is a function of the stimuli that elicit it (i.e., 
Negro). This may be seen in B of Figure 2. 

The probability that the stimuli (e.g, dark 
skin, thick lips) elicit the mediator or concept 
“Negro,” also influence the evaluation of (the 
attitude toward) “Negro.” If a stimulus that 
elicits a strong evaluative response only elicits 
“Negro” 40 per cent of the time, while another 
stimulus that also elicits a strong evaluative 
response elicits “Negro” 80 per cent of the time, 
the latter will contribute more to the evalua- 
tion of “Negro” than will the former. Thus 
attitude toward any concept is a function of 
the evaluations of the stimuli that elicit the 
concept and the probability that these stimuli 
will elicit the concept. 

This conception of attitude acquisition dif- 


fers from Rhine's in that, following Osgood 
et al. (1957), all stimuli have evaluative re- 
sponses associated with them. Furthermore, 
every time a new concept is learned, an attitude 
is automatically acquired with it. Attitude 
acquisition is an automatic, nonverbalized 
process that occurs in conjunction with con- 
cept learning. 


2. A MODEL OF THE RELATIONSHIPS BE- 
TWEEN BELIEFS ABOUT AN OBJECT AND 
THE ATTITUDE TOWARD THE OBJECT 


The theory of concept formation and attitude 
acquisition carries important implications for 
the study of beliefs about an object and the 
attitude toward it. If concepts are learned as 
the theory suggests, it follows that an indi- 
vidual's beliefs about the concept should be 
some function of the learning process. In the 
example above, we can assume that at least 
two of the individual’s beliefs about “Negroes” 
are that (1) Negroes have dark skin, and (2) 
Negroes have thick lips. Furthermore, if “dark 
skin" has elicited "Negro" more frequently 
than "thick lips" has, his belief that Negroes 
have dark skin should be stronger than his 
belief that Negroes have thick lips. If, for the 
purpose of explication, we consider the process 
of concept formation as input, and beliefs 
about the object as output, then (even grant- 
ing the asymmetry of backward and forward 
association) immediately after concept forma- 
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tion the output should be highly correlated 
with or equal to the input. This may be seen 
im Figure 3, which presents the theoretical 
model of the output side of the relationship, 
the model of the relations between beliefs 
about an object and the attitude toward that 


object. 


ative mediator; s — the stimulus produced by a 


In Figure 34, it can be seen that immediately 
after concept formation, the concept (or stimu- 
lus) elicits the set of responses that have served 
to define it (eg, "dark skin," "thick lips," 
etc). Each of these mediating responses, how- 
ever, also serve as stimuli. Viewing these re- 
sponses in this way (i.e, as stimuli), it can be 
seen that the right-hand side of Figure 34 is 
identical to the model of attitude acquisition 
presented in Figure 2B. That is, “dark skin,” 
“thick lips,” etc, will each tend to elicit the 
concept “Negro” as well as a positive, negative, 


or neutral evaluative response. As was discussed 
above, these evaluative responses summate 
through the mediation process, the summated 
evaluative response becomes associated with 
the concept “Negro.” 

In addition, the summated evaluative re 
sponse (ie, the attitude) also becomes associ 
ated with the concept (or stimulus) through the 
process of classical conditioning. This can be 
seen in Figure $B, where, for the purpose of 
presentation, only one belief is considered. 
Following the classical conditioning paradigm, 
it can be seen that, to a certain degree, the 
evaluative response. (i.e, the UCR—uncondi- 
tioned response) elicited by "dark skin" (ie. 
the UCS—unconditioned stimulus) becomes 
associated with the concept "Negro" (ie., the 
CS—conditioned stimulus). The stronger the 
association between "Negro" and "dark skin" 
(i.e., the stronger the belief that “Negroes have 
dark skin"), the more the evaluation of "dark 
skin" will become associated with "Negro." 
However, as already mentioned, an individual 
has many beliefs about any given concept, and 
the evaluation associated with each of these 
beliefs will also become associated, in part, 
with the attitude object or concept. This can 
be seen in Figure 3C. Again, it should be noted 
that the evaluative responses associated with 
each of the beliefs are viewed as summative; 
thus it is this summated evaluative response, 
i.e., this attitude, that becomes associated with 
the concept (e.g, Negro) Furthermore, the 
amount of the evaluative response associated 
with each belief that is available for summa- 
tion is a function of the strength of the belief. 
That is, if "Negro" elicits "dark skin" 95 per 
cent of the time and "thick lips" 60 per cent 
of the time, the evaluation of "dark skin" will 
contribute more to the evaluation of "Negro" 
than will the evaluation of "thick lips." 

An individual's attitude toward any object, 
then, is learned as a result of both mediation 
and conditioning. The complete model of the 
relations between beliefs about an object and 
the attitude toward that object can be seen in 
Figure 4. It should again be noted that the 
model of attitude acquisition is included within 
the model of the relations between beliefs and 
attitude. Thus it becomes apparent that beliefs 
about an object and the attitude toward that 
object are in a continuous, dynamic relation- 
ship. Changes in any one part of the system 
may produce changes in all the other parts. 
In addition, in Figure 4, it can be seen that 
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Figure 4. A model of the relation between belief 
about and attitude toward an object. S — the 
stimulus, i.e, attitude object; i= first-order media- 
tors elicited by S, indicating belief r, about S; 
= an evaluative mediator; r, — a second-order 
mediator. 


the summated evaluative response (ie, the 
attitude) that is learned in concept formation 
is identical (for all practical purposes) to the 
summated evaluative response acquired through 
conditioning. However, it should be recalled 
that the discussion above was only concerned 
with the beliefs an individual held immedi- 
ately following concept attainment. Once a 
concept has been learned (or once a given stim- 
ulus has been labeled), new beliefs are acquired 
and some of the original beliefs may be weak- 
ened or strengthened. That is, "new" concepts 
or “responses” become associated with the atti- 
tude object, and many of the original S-r asso- 
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(A) 


ciations may be positively or negatively 
reinforced. Each of these changes in belief will 
affect the evaluation of the attitude object (i.e., 
the stimulus concept). This may be seen in 
Figure 5. 

In A of Figure 5, it can be seen that, follow- 
ing concept formation, the presentation of a 
"stimulus" (eg, Negro—Sy) will elicit a 
learned, mediating evaluative response (i.e, an 
attitude—r,) and a response representing the 
stimulus (i.e., ry). That is, the subject tends to 
read, or to repeat to himself, the stimulus 
toward which he is attending; he makes a “la- 
beling" response (Hovland, Janis, and Kelly, 
1953). Furthermore, it should be recalled that 
the learned mediating evaluative response is 
a function of the individual's initial beliefs 
about the attitude object. Figure 5 is identical 
to Figure 4, except that the mediating beliefs 
(e.g., "dark skin," "thick lips," etc.) have been 


omitted for purposes of presentation. 


Once the individual has learned the concept, 
however, he may learn new associations to it. 
For example, he may now learn that "Negroes 
are athletic." This new response (i.e. athletic 
—r,) becomes a part of the individual's habit- 
family-hierarchy of responses to the stimulus 
"Negro" (Sy). Similar to the other responses 
in the hierarchy, the response "athletic" may 
also be viewed as a stimulus that itself elicits a 
learned mediating evaluative response. This 
evaluative response elicited by "athletic" will 
summate with the evaluative response elicited 
by the attitude object (i.e, Negro), which, it 
will be recalled, is itself a summated response 
based on all the other beliefs in the individual's 
hierarchy. Through the processes of condition- 
ing and mediated generalization, this "new" 
summated evaluative response becomes associ- 
ated with the stimulus concept (ie. Negro); 
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Figure 5. A model of attitude change. (A) Before learning new information; (B) 
after learning new information. Sy = the stimulus (e.g., Negro); r, — an evalua- 
tive response; ry, — a mediating response representing the stimulus; r, — a media- 
ting response the newly learned information (e.g., Negroes are 
athletic); s — the stimulus produced by a mediator. 
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thus, on future occasions, the attitude object 
(ie. Negro) will elicit this "new" summated 
mediating evaluative response (ie this atti- 
tude). This may be seen in B of Figure 5. It 
should again be recalled that the stronger the 
association between “Negro” and "athletic" 
(ie, the stronger the Sy-r, association or the 
stronger the belief that “Negroes are athletic"), 
the more the evaluation of "athletic" will con- 
tribute to the evaluation of "Negro." Similarly, 
any change in the strength of previously held 
beliefs (i.e, weakening or strengthening any of 
the S-r associations) will also change the evalu- 
ative influence of that belief on the final eval- 
uation of the stimulus concept (Sy). 

Because most learning probably occurs after 
the concept is learned (or the stimulus is 
"labeled"), attitudes can best be viewed as be- 
ing functions of the individual's beliefs about 
the attitude object. Indeed, it is possible that 
many of the stimuli that originally influenced 
concept formation do not remain in the indi- 
vidual's belief system. That is, they may drop 
out of the response hierarchy completely either 
through replacement by new beliefs or through 
negative reinforcement. 

To summarize, then, attitudes are most 
likely learned initially as part of the process 
of concept formation. Once the concept has 
been learned, however, the individual learns 
many new things about it, that is, he associates 
many different objects, concepts, values, or 
goals with the attitude object (the stimulus 
concept). This set of responses associated with 
the concept may be viewed as a belief system 
—a habit-family-hierarchy of responses. The 
higher the response in the hierarchy, the 
greater the probability that the response is asso- 
ciated with the stimulus concept, that is, the 
Stronger the belief. Each of these associated 
responses may also be viewed as stimuli, which 
themselves elicit a learned mediating evaluative 
response, These mediating evaluative responses 
are viewed as summative; through the processes 
of mediated generalization and conditioning, 
this summated evaluative response becomes 
associated with the stimulus concept. Thus, 
when the concept is presented, it will elicit 
this summated evaluative response, that is, it 
will elicit this learned attitude. Finally, it 
should be noted that the higher the response 
in the hierarchy (i.e. the stronger the belief), 
the greater will be the amount of its evaluative 
response that is available for summation. 

Thus, in its simplest form, the theoretical 


model (see Figure 4) leads to the prediction 
that an individual's attitude toward any object 
is a function of (1) the strength of his beliefs 
about the object (i.e. those beliefs in his re- 
sponse hierarchy) and (2) what Fishbein (1963) 
has called the evaluative aspect of those beliefs 
(i.e., the evaluation of the associated responses). 
Algebraically, this may be expressed as follows: 


N 
LÁ > Bea; 
1 


where A, = the attitude toward object o 

B, — the strength of belief i about o, that 
is, the "probability" or "improba- 
bility" that o is associated with 
some other concept x; 

4, — the evaluative aspect of B, that is, 
the evaluation of x; 

N — the number of beliefs about o, that 
is, the number of responses in the 
individual's habit-family-hierarchy 


Before turning to a consideration of some of 
the evidence supporting this hypothesis, sev- 
eral points should be made: 


l. It should be noted that this prediction 
is similar to predictions made by other inves- 
tigators (e.g, Smith, 1949; Cartwright, 1949; 
Rosenberg, 1956, 1960; Zajonc, 1954; Peak, 
1955). For example, Rosenberg, working from 
the point of view of a consistency principle, 
has predicted that the affect attached to an 
attitude object will be highly related to (a) 
"the perceived instrumentality of the attitude 
object, that is, the judged probability that 
the attitude object would lead to, or block, the 
attainment of "valued states,” and (b) the 
"value importance," that is, the intensity of 
affect expected from these "valued states." 
Similarly Zajonc, working within the frame- 
work of a theory of "cognitive set," has pre- 
dicted that the valence of (i.e., attitude toward) 


1 Algebraically, the central equation of Rosen- 
berg's theory may be expressed as follows: 


where 4, = the attitude toward the object, 1; = 
the belief or probability that the object will lead 
to or block the attainment of a given valued state 
"i," V, the "value importance" or the amount of 
affect expected from valued state “i,” and N = the 
number of beliefs. 
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any object is a function of (a) the valence of 
its characteristics, and (b) the “prominence” 
of these characteristics, where “prominence” 
refers to “the ability of the characteristic to 
represent the object,” that is, the belief that 
the characteristic is indeed a defining attribute 
of the object.? Although there are several theo- 
retical and methodological differences between 
the various theories that have dealt with the 
belief-attitude relationship (e.g., see Fishbein, 
1961), the important point is that all of them 
essentially lead to the hypothesis that an indi- 
vidual's attitude toward any object is a func- 
tion of his beliefs about the object and the 
evaluative aspects of those beliefs. 

2. It should also be noted that this hypothesis 
is entirely consistent with the way in which 
most standardized attitude measurement instru- 
ments obtain their estimates of attitude. That 
is, as Fishbein (see pp. 257 to 266 in this book) 
has pointed out, most of the standard attitude 
measurement instruments (e.g., Thurstone 
Scales, Likert Scales, Guttman Scales, etc.) ob- 
tain their estimates of attitude through a con- 
sideration of a set of the respondent's beliefs 
about the attitude object and the evaluative 
aspects of those beliefs. Thus, in a sense, the 
algebraic formula presented above may be 
viewed as a general formula for obtaining esti- 
mates of attitude. In Figure 3D, however, it 
can be seen that the only beliefs that serve as 
determinants of an individual’s attitude are 
those that are present in his habit-family- 
hierarchy of responses. That is, although all of 
an individual's beliefs about an object serve 
as indicants of his attitude toward the object, 
it is only the individual's salient beliefs, i.e., 
those in his hierarchy, that serve as determi- 
nants of attitude. Although a complete discus- 
sion of the distinction between determinants 
and indicants of attitude is beyond the scope 
of the present paper, this distinction does sug- 
Best that the best estimates of attitude will be 


? Algebraically, the central equation of Zajonc's 
theory may be expressed as follows: 


N 


o Prom(a) 


1 
4, = Va(o) = —— — 

(N2 + N)/2 
where A o = attitude toward the object, Va(o) = 
the valence of the object, Va(a) = the valence of 
the characteristic “a,” Prom(a) = the prominence 
of characteristic “a,” and N = the number of char- 
acteristics. 


obtained when the estimate is based solely on 
a consideration of an individual's salient be- 
liefs. Support for this hypothesis may be found 
in Rosenberg's contribution to this book (see 
pp. 325 to 331). He found that estimates of 
attitude based on a consideration of an indi- 
vidual's salient beliefs (i.e., those elicited by the 
subject) were considerably more accurate than 
estimates based on a consideration of 35 be- 
liefs selected on an a priori basis. 

It should be recalled that most attitude mea- 
surement instruments consist of a series of 
belief statements selected on some a priori 
grounds. Thus most of the beliefs that they 
contain are probably not salient for the re- 
spondents. Although these instruments will 
still provide valid estimates (or measures) of 
attitude, there is undoubtedly some loss in the 
precision of the estimate. Clearly, the greater 
the proportion and absolute number of salient 
beliefs contained in the instrument, the smaller 
will be the loss in validity. 

This, however, raises a question about the 
number of beliefs that can be salient for an 
individual. That is, how many “objects, con- 
cepts, values, or goals" can an individual asso- 
ciate with an attitude object at any point in 
time? Studies on the span of attention or appre- 
hension suggest that, in general, an individual 
can only perceive, and attend to, six to eleven 
objects at the same time (Woodworth and 
Schlossberg, 1954; Miller, 1956). Even though 
groupings do increase the number of objects 
that can be perceived, it seems likely that only 
six to eleven beliefs are salient, that is, are in 
the individual's hierarchy, at any one time. 
"That is, although an individual may have many 
beliefs about any given attitude object, there 
are probably only six to eleven beliefs that 
actually appear in his hierarchy (i.e. that are 
above some response threshold) and function 
as determinants of attitude. Somewhat the same 
kind of notion has been suggested by March 
and Simon (1958) in their distinction between 
"satisficing" and "optimizing." According to 
these investigators: 


Most human decision-making . . . is concerned 
with the discovery and selection of satisfactory 
alternatives; only in exceptional cases is it con- 
cerned with the discovery and selection of op- 
timal alternatives. (p. 141) 


From the point of view presented here, "satis- 
ficing" involves only the six to eleven imme- 
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diately salient beliefs, whereas "optimizing" 
takes into account many more beliefs. 

A recent study by Kaplan (1966) provides 
some support for the hypothesis that only six 
to eleven beliefs function as the primary deter- 
minants of an individual's attitude. Kaplan 
asked his subjects to list as many "character- 
istics, qualities, and attributes" of “Negroes” 
as they could. The number of beliefs presented 
by subjects varied from 3 to 25. Two estimates 
of attitude were computed for each subject— 
one based on a consideration of all his re- 
sponses, and one based on a consideration of 
only “salient” beliefs, with saliency being oper- 
ationally defined in terms of position in the 
response hierarchy. That is, Kaplan assumed 
that, at most, a subject might have nine salient 
beliefs. Thus, even though a subject might have 
listed more than nine beliefs, only the first 
nine that he listed were taken into account in 
this latter estimate of his attitude. Consistent 
with expectations, the estimates of attitude 
based only on the “salient” beliefs were more 
accurate predictors of attitude than estimates 
based on the total set of beliefs an individual 
listed. 

Although the findings of Rosenberg and 
Kaplan support the hypothesis that an indi- 
vidual's attitude toward any object is primarily 
determined by his salient beliefs, and that 
there are probably only six to eleven beliefs 
that are salient, it must be made clear that 
these findings do not imply that valid estimates 
of attitude cannot be obtained from nonsalient 
beliefs. Indeed, both of these investigators did 
obtain valid estimates of attitude from consid- 
erations of nonsalient beliefs. However, the 
Most precise estimates of attitude were those 
based solely on salient beliefs and their evalu- 
ative aspects. Furthermore, because only salient 
beliefs function as determinants of attitude, it 
will be only through a consideration of these 
beliefs that one will be able to gain an under- 
standing of the genesis of attitude. 

3. A final question concerns the types of be- 
liefs that are related to and/or function as 
determinants of attitude. A review of the litera- 
ture on attitude organization and change indi- 
cates considerable controversy about this 
question. For example, many investigators (e.g. 
Krech and Crutchfield, 1948, Katz and Stotland, 
1959; Abelson and Rosenberg, 1958) have at- 
tempted to distinguish between beliefs that are 
attitudinal in nature (i.e., those that contain an 
implicit or explicit evaluation of the attitude 


object) and beliefs that are unrelated to atti- 
tude (i.e., so-called “descriptive or reportorial" 
beliefs). Indeed, with very few exceptions (e.g., 
Campbell, 1950; Zajonc, 1954; Fishbein, 1963), 
investigators have tended to ignore these “de- 
Scriptive or reportorial" beliefs in their in- 
vestigations of attitude. Furthermore, due to 
the various techniques of item selection that are 
used, this type of belief is seldom, if ever, found 
as an item on one of the standard attitude 
measuring instruments. From the point of 
view of theory presented here, however, any 
type of belief (e.g. descriptive, reportorial, 
instrumental, etc.) may serve as a determinant 
or indicant of an individual's attitude. That is, 
any belief that is present in the individual's 
habitfamily-hierarchy has an evaluative medi- 
ating response associated with it (i.c., all beliefs 
have evaluative aspects), and thus it will con- 
tribute to the individual's attitude. In addition, 
it seems reasonable to assume that the strongest 
beliefs about an object that an individual holds 
are those beliefs that serve to define and de- 
scribe the object for him, that is, descriptive 
beliefs. Because these beliefs are likely to be 
high in the individual's hierarchy, they prob- 
ably serve as some of the most important de- 
terminants of attitude. Thus, rather than ignore 
"descriptive or reportorial" beliefs, the present 
approach suggests that increased attention 
should be paid to them in future studies of 
attitude organization and change. 

Because most investigators have tended to 
ignore "descriptive beliefs" it was felt that 
this type of belief would provide the most 
severe test of the hypothesis. Thus, in the 
initial test of the theory presented above, an 
attempt was made to predict subjects' attitudes 
toward Negroes from a consideration of their 
descriptive beliefs about Negroes (ie. their 
beliefs about the characteristics and components 
of Negroes), and from the evaluative aspects 
of these beliefs. Specifically, following a pro- 
cedure developed by Maltzman, Bogartz, and 
Breger (1958), 125 Ss listed what they believed 
to be the five characteristics that best described 
Negroes. The ten characteristics of Negroes 
that were most salient for the population, that 
is, the ten most frequent responses given by 
the subjects, were then selected for further 
consideration (e.g, dark skin, curly hair, 
athletic, musical, tall, etc). Although it is 
clear that not all of the beliefs considered are 
"pure descriptive beliefs,” many of them (e.g 
Negroes have dark skin, Negroes have curly 
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hair, etc.) would be considered as “non- 
evaluative” and/or unrelated to attitude by 
most investigators. Two weeks later, 50 of the 
subjects returned for a second session of the 
experiment. Using Fishbein and Raven's (1962) 
evaluative (4) and probability (B) Scales to 
measure attitude and belief respectively (see 
pp. 183 to 189 in this book), each subject rated 
each of the characteristics on the 4 Scale and 
each of the belief statements on the B Scale. In 
addition, all Ss rated the concept “Negro” on 
the A Scale. Using the algebraic formula pre- 
sented above (ie. EB;a;), estimated attitude 
scores were computed for each subject. That is, 
a subject's rating of each belief statement (i.e., 
B,) was multiplied by his rating of the charac- 
teristic that was related to “Negroes” (i.e., aj), 
and these ten products (one for each belief) 
were then summed. In support of the theory, 
the Spearman rank-order correlation between 
estimated and obtained attitudes (i.e., the direct 
evaluation of the concept “Negro” on the A 
Scale) equaled .801 (N = 50, p < .001). 

In other studies, it has been found that a 
leader's attitudes toward the members of his 
group could be predicted from a knowledge of 
his beliefs about the members’ behaviors (i.e., 
his rating of the “probability” or “improbabil- 
ity” that the member “listened attentively to 
others,” “expressed his opinions tactfully,” etc.) 
and the evaluations of those behaviors (Fish- 
bein, 1965). Similarly, in an unpublished study, 
Fishbein and Feldman have obtained evidence 
that a voter’s attitude toward a political candi- 
date (on either the Presidential or the Congres- 
sional level) is a function of his (i.e., the voter's) 
beliefs about the characteristics of the candidate 
(e.g. "he has legislative experience," "he is a 
farmer") and the candidate's stands on various 
issues (e.g., "he is in favor of Medicare," “he is 
in favor of an immediate end to atmospheric 
nuclear testing"), and his (i.e. the voter's) eval- 
uations of these characteristics and issues. 

These findings, together with the previous 
findings of Rosenberg (1956, 1960), Zajonc 
(1954), and others, provide strong support for 
the hypothesis that an individual's attitude to- 
ward any object is a function of his beliefs 
about the object and the evaluative aspects of 
those beliefs. It should be noted, however, that 
these findings do not necessarily support the 
particular theoretical model of the relationships 
between beliefs and attitudes that has been 
proposed here. That is, as was mentioned ear- 
lier, other investigators have arrived at similar 


hypotheses from different theoretical view- 
points. Thus the findings reported above lend 
as much support to their theoretical models as 
they do to the one presented here. The model 
presented here, however, does provide an alter- 
native way of viewing the belief-attitude rela- 
tionship and thus suggests research hypotheses 
that would not necessarily follow from the other 
theories (e.g, the hypothesis that an individ- 
ual's attitude is primarily determined by only 
six to eleven salient beliefs). Although a discus- 
sion of all the implications of the theory is 
beyond the scope of the present paper, a few 
implications are worth considering. 


SOME IMPLICATIONS OF THE THEORY 


The most obvious implications of the theory 
concern the question of attitude change. Ac- 
cording to the theory, attitude change will occur 
when: (1) an individual's beliefs about an ob- 
ject change and/or (2) when the evaluative 
aspect of beliefs about an object change. It 
should be noted that beliefs about an object 
may change in two ways: (1) new beliefs may 
be learned, that is, new concepts may be related 
to the attitude object, new stimulus-response 
associations may be learned, and (2) the strength 
of already held beliefs may change, that is, the 
position of beliefs in the habit-family hierarchy 
may be altered through positive or negative re- 
inforcement. Furthermore, referring back to 
Figure 5B, it can be seen that the amount and 
direction of attitude change will be a function 
of (1) the individual's initial attitude and, thus, 
the number, strength, and evaluative aspects of 
his salient beliefs, and (2) the number, strength, 
and evaluative aspects of the new beliefs he 
learns. Here, however, an important distinction 
must be made between learning the contents 
of an attitude change communication and 
learning something about the attitude object. 
That is, an individual's attitude toward some 
concept will only change if he learns something 
new about the concept, if he forms a new S-r 
association. Simply learning that "the commu- 
nication says $ is 7" will not produce attitude 
change. To use the terminology of Hovland, 


3 It should be noted that learning that "the com- 
munication says $ is 7" can produce delayed atti- 
tude change. That is, over time, the individual 
may forget the source of information and only re- 
call that "S is r.” This "new" belief about S may 
then lead to attitude change. This phenomenon 
of delayed change has been referred to as "the 
sleeper effect" by Hovland, Janis, and Kelley (1953). 
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Janis, and Kelley (1953), attitude change only 
occurs when the individual “accepts” the com- 
munication. Unfortunately, most so-called tests 
of “learning” that are conducted in attitude 
change studies are merely tests of retention or 
recall of the contents of the communication, 
that is, they are measures of the subjects’ beliefs 
about what the communication said, rather than 
tests of whether the subjects have learned the 
S-r associations that the message was designed 
to teach them, that is, measures of the subjects’ 
beliefs about the attitude object. 

In addition, it should be noted that accord- 
ing to the theory, every time an individual 
learns a new belief that associates the attitude 
object with some positively evaluated concept, 
his attitude will change in a positive direction. 
Similarly, if the new belief associates the atti- 
tude object with a negatively evaluated con- 
cept, his attitude will change in a negative 
direction. That is, attitude change, as well as 
attitude per se, is viewed as a function of the 
total amount of affect associated with an indi- 
vidual’s beliefs about the attitude object. In 
contrast to this, most theories based on a no- 
tion of "consistency" would predict that atti- 
tudes and attitude change are functions of the 
mean amount of affect associated with an indi- 
vidual's beliefs According to these theories, if 
an object is associated with other objects that 
are positively evaluated, it is "consistent" to 
have a positive attitude toward it. Similarly, if 
the object is associated with other objects that 
are negatively evaluated, it is "consistent" to 
have a negative attitude toward it. If the ob- 
ject is associated with some objects that are 
positively evaluated and some that are nega- 
tively evaluated, a relatively neutral attitude 
would be consistent. Furthermore, if the object 
is associated with "extremely good things," a 
high positive attitude is consistent; if it is as- 
sociated with "slightly good things" a low 
positive attitude is consistent. From a strict con- 
sistency viewpoint, then, if an object is asso- 
ciated with some “extremely good things” and 
some “slightly good things,” an individual's at- 
titude toward it should be somewhere between 
high and low positive if it is to be consistent. 
Thus, if an individual originally believed that 
the object was associated with “extremely good 
things,” and then learned some new beliefs as- 
sociating the object with “slightly good things,” 
4 A notable exception to this is Rosenberg (1956), 


who bases his theory on a notion of consistency, 
yet postulates cognitive summation. 


according to most consistency theories these new 
beliefs (even though they associate the object 
with positively evaluated things) will actually 
serve to lower the individual’s attitude. This 
prediction is explicitly made in Osgood’s con- 
gruity theory, and implicitly follows from Hei- 
der's balance theory and Festinger's dissonance 
theory. Thus, although the theory proposed 
here views attitude organization and change 
as processes of "cognitive summation," most 
theories based on a notion of consistency view 
attitude organization and change as processes 
of "cognitive balance" or "cognitive averag- 
ing."5 

Finally, it is worth noting that the theory 
also has implications for an understanding of 
the relationships between attitudes and behav- 
ior. In general, psychologists have had little 
success in attempting to predict overt behavior 
of the non-pencil-and-paper type from attitudes. 
At least one of the major reasons for this lack 
of success is the fact that the attitude that is 
measured is usually inappropriate. That is, the 
attitude that is measured is usually an attitude 
toward some concept "X," while the behavior 
that is predicted is Ss behavior with respect to 
some object "x" (a single instance of the gen- 
eral class of X). For example, an investigator 
might obtain measures of an individual's atti- 
tude toward "Negroes" or "Jews" and then 
attempt to predict the individual's behavior 
with respect to a particular Negro or a par- 
ticular Jew. In the classic study by LaPiere 
(1934), restaurant, hotel, and motel owners, 
after giving service "with no trouble" to a Chi- 
nese couple, were later asked, "Will you accept 
members of the Chinese race as guests in your 
establishment?" Over 90 per cent of the re- 
sponses to this question were negative. Thus, 
although these people responded negatively to- 
ward the general concept "Chinese," this was 
not reflected in their behavior toward specific 
individuals within the general class. 

Generally speaking, the attitudes measured 
on attitude scales, at least when dealing with 
attitudes toward specific national and ethnic 
groups, are attitudes based on stereotypes. That 
is, the beliefs about the group that are salient 
for the individual are general characteristics 
(e.g. dark skin, curly hair, musical, athletic, 
lazy, etc.) that serve to define, describe, and 
differentiate the general class of stimuli (e.g. 


5 For a further discussion of the distinction be- 
tween summation (adding) and balance (averaging). 
sec pages 437 to 443 in this book. 
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Negroes) that the person is rating. When a per- 
son is confronted with a specific Negro, how- 
ever, his beliefs are likely to be quite different 
than those that serve to describe “Negroes in 
general.” Because an individual's attitude to- 
ward any object is a function of his salient 
beliefs about the object, it follows that his atti- 
tude toward “Negroes in general" (i.e., the atti- 
tude measured on the attitude scale) is likely to 
be quite different than his attitude toward any 
particular Negro. Clearly, if a relationship be- 
tween attitude and behavior does exist (and 
there are some questions about this assump- 
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C. Problems, Prospects, and Alternatives in Attitude 


Theory 


45. The Current Status of Cognitive Consistency Theories 


WILLIAM J. McGUIRE 


It is a rather common occurrence in the history 
of science for a number of similar theories to 
be put forward contemporaneously by research- 
ers who have little or no direct contact with one 
another. Merton has assembled several hundred 
cases of this sort (eg, Merton, 1957). In the 
1860's about six chemists suggested the essentials 
of the periodic law and Kuhn (1959) reports 
that the principle of energy conservation had a 
dozen different discoverers around the 1840's. It 
need not surprise us therefore that at least a 
half dozen of what we shall call "consistency 
theories" appeared in the psychological litera- 
ture during the 1950's These were proposed 
under various names (balance, congruity, sym- 
metry, dissonance, etc.) but had in common the 
notion that the person tends to behave in ways 
that minimize the internal inconsistency among 
his interpersonal relations, among his intraper- 
sonal cognitions, or among his beliefs, feelings, 
and actions. While resembling one another in 
their use of this fundamental predictive postu- 
late, the various theories differed from one an- 
other in many respects, for example, in ways of 
defining consistency, in the additional assump- 
tions made, and as regards the behavioral realms 
to which the derived predictions are applied. In 
this paper we shall touch upon both common- 
alities and diversities among the various theo- 


* Excerpted from S. Feldman (Ed.), Cognitive Con- 
sistency: Motivational Antecedents and Behavioral 
Consequences, Academic Press, New York, 1966, 
with permission of the author and the publisher. 


ries, and both will be pursued in depth in later 
chapters. 

After the appearance in print of these several 
consistency theories, research stemming from 
them grew rapidly, until currently they seem to 
us to furnish the point of theoretical departure 
for more personality and social psychological 
research than does any other one theoretical 
notion. This paper is restricted to five tasks that 
I shall turn to in the following order: (1) to cite 
the early sources of this flourishing of consis- 
tency theory; (2) to sketch out the diverse di- 
rections it has taken; (3) to review some of the 
unresolved problems and conflicts among con- 
sistency theories; (4) to suggest where consistency 
theory fits within the larger framework of psy- 
chological theory and research; and (5) to de- 
liver a final word of prognostication about the 
future of consistency theory. 


EARLY WORK ON CONSISTENCY THEORY 


As Ebbinghaus (1909) said of psychology as a 
whole at the turn of the century, the currently 
lively theoretical notion of cognitive consistency 
has a long past but a short history. It must be 
admitted that there is some suspicion of descent 
from the medieval notion of logical man or the 
notion of rational man in economics which was 
so popular as a guiding postulate for explaining 
human behavior in the early days of that sci- 
ence. Yet these notions were ambiguously used 
in those contexts. “Rationality” was used to 
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refer sometimes to reality orientation and some- 
times to internal logical consistency. The latter 
is closer to the current meaning of cognitive 
consistency, but currently we use “consistency” 
more in the sense of operating in accord with 
the rules of psycho-logic (Abelson and Rosen- 
berg, 1958) than with those of logic. 


Beginnings of Formal Consistency Theory 

History began with Heider's (1946) paper on 
attitude and cognitive organization in three- 
element structures which posit, for example, that 
if person p likes other person o, and o likes 
object x, then p will tend to like x also. In 
brief, it was argued, there will be a tendency 
towards balance such that in a triad there are 
likely to be either two minus (eg. "dislike") 
links or none. Put more generally, such cycles 
will tend to have an even number of minuses, 
so that the product of the link signs will yield 
a positive value. Newcomb (1953, 1956) applied 
this theory to communicative acts, examined its 
motivational basis, and elaborated it in terms 
of role theory, in his A-B-X model. Cartwright 
and Harary (1956) generalized these three-ele- 
ment systems to structures of any number of 
elements, solving for consistency level by the 
use of graph theory and postulating a number 
of principles that yield predictions about em- 
pirical situations involving interpersonal rela- 
tions. 

Abelson and Rosenberg (1958) developed a 
system using matrix algebra to define the con- 
sistency level in a multielement structure, and 
à series of postulates for applying their notions 
to intrapersonal cognitive systems. While most 
of the earlier researchers dealt with interper- 
sonal situations, the work of Abelson and Ro- 
senberg concerned consistency within one skull. 

Still another intrapersonal consistency ap- 
proach, quite simple in its postulates but with 
wide-range and provocative applications, is the 
theory of cognitive dissonance proposed by 
Festinger (1957) and since then greatly elabo- 
rated (J. W. Brehm and Cohen, 1962; Festinger, 
1964; Festinger and Aronson, 1960). While re- 
maining frustratingly Sparse and informal re- 
garding its specific statement, dissonance theory 
has been exceedingly productive in terms of 
the amount of experimentation it has inspired. 

Two other consistency theories of more di- 
verse content and origin had in common an 
attempt at quantifying the links between cogni- 
tions. Osgood and Tannenbaum (1955) devel- 
oped their congruity model to account for 


changes in evaluations that result from outside 
information linking cognitions that have differ- 
ent valences. They introduced quantification of 
the individual links, the polarity principle, the 
assertion constant, and the correction for in- 
credulity into the theory. McGuire (19604, 
1960c) attempted to define consistency among 
propositions included in the individual's belief 
system and to handle quantitatively the grada- 
tions in assent to each proposition; he also 
attempted to identify sources of cognitive in- 
consistency and the probable foci of changes to 
reduce it. In brief, he has posited that the indi- 
vidual acts as honest broker among contending 
beliefs, seeking a least-squares solution for dis- 
crepancies among them. He concentrates par- 
ticularly on reconciling “logical thinking” 
(minimizing discrepancies between the beliefs 
on a given issue and on logically related issues) 
with “wishful thinking" (minimizing discrep- 
ancies between belief and desire on each in- 
dividual issue). One's least-squares solution, he 
argues, may favor the logical or the wishful 
realm of consistency, depending on situational 
manipulations of the salience of the two (Mc- 
Guire, 19605). 

Another source of cognitive consistency theo- 
rizing is the trichotomy as regards the existential 
stances man can take—feeling, knowing, and 
acting form a trichotomy that goes back at least 
to Plato. In psychology, Allport's old (1935), but 
still respectable, review of the attitude area 
attributed to the ancients the perennial analysis 
of attitudes into three corresponding compo- 
nents: affective, cognitive, and conative. Many 
students of attitudes have developed this notion 
in detail (e.g, D. T. Campbell, 1947; Chein, 
1948; Doob, 1947; Harding, Kutner, Proshan- 
sky, & Chein, 1954; Krech, Crutchfield & Bal- 
lachey, 1962). The terminology has changed and 
the level of sophistication in the research has 
varied, but interest in the analysis has been a 
persisting one. Recently, Fishbein (e.g., 1963) 
has brought much precision to the area. The 
work of Rosenberg (1960a, 1960) on this prob- 
lem is also noteworthy for its imaginativeness. 


DIRECTIONS TAKEN BY CONSISTENCY 
THEORY 


Those of us who have found the consistency 
formulation useful in our work have used it to 
raise and to suggest answers for a wide variety 
of questions. In this section we propose to €x- 
hibit the heuristic power of this formulation by 
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reviewing a wide range of the types of problems 
towards which attention has been attracted by 
consistency theory. First, we shall discuss no- 
tions of how, if we have the tendency towards 
consistency, there is so much inconsistency in 
our experimental subjects to start with. Next, 
we shall review experimental procedures used 
to bring these consistency needs to bear more 
fully on a domain of cognitions. Third, we shall 
survey the various modes of consistency reduc- 
tion that have been subjected to experimental 
study. Fourth, we shall discuss findings regard- 
ing the relative likelihood that consistency will 
be restored by changing one kind of cognition 
rather than another. A fifth topic for review 
will be the types of consistency that seem to be 
preferred. Finally, we shall consider predictions 
derived from consistency theory regarding com- 
munication, persuasion, and action effects. 


Origins of Inconsistency 


It may seem paradoxical or even embarrassing 
for the theory that we posit a strong tendency 
towards consistency and yet always manage to 
find sufficient inconsistency within the system 
to allow predictions to be made and tested re- 
garding modes of resolving inconsistencies. It 
may be worthwhile, then, to list a half-dozen of 
the ways in which inconsistencies enter into the 
cognitive system in the first place and in which 
they are maintained there. 

First, there may be inconsistencies among the 
person's cognitions because of methodological 
artifacts, such as acquiescence response sets, that 
cause him to agree with both a statement and 
its contradictory in "balanced" inventories 
(Couch and Keniston, 1960; O'Donovan, 1965; 
Rorer, 1965); likewise, context effects, such as 
those dealt with in assimilation-contrast theory 
(M. Sherif and Hovland, 1961; C. W. Sherif, 
M. Sherif, and Nebergall, 1965), can lead him 
into mutually inconsistent positions. Second, 
there are the logical shortcomings of the human 
cognitive apparatus that lead to material falla- 
cies (the ambiguous "some," amphibology, etc.) 
or to formal fallacies (such as illicit process of 
the major, atmosphere effect, experimental war- 
iness, a. fortiori effect, etc.) that have been re- 
viewed by McGuire (1960c). 

"Third, the fault may lie in the stars and not 
in ourselves. The notion of a society free of 
inherent contradictions is going out of fashion. 
Cognitions that reflect an inconsistent social 
world (e.g., see Myrdal, 1944, on the American 
Dilemma) must be internally contradictory. 


People are cast in conflicting roles. They are 
called upon to serve God and man when the 
two may be at odds. The public safety officer at 
a time of disaster is torn between his duty to 
the broader public and to his own family. And 
even if the universe is inherently consistent in 
the eyes of God, man sees only a sample of in- 
formation and so, unaware of the higher syn- 
thesis (cf. Abelson, 1959, p. 346), he may be left 
with a contradiction in his partial and even 
biased sample. 

Fourth, the world may change, leaving the 
person encumbered with conceptual baggage 
that no longer accords with reality. He may 
labor to put his political leader into office only 
to see the soughtafter patronage plum lifted 
past his waiting hands into those of the van- 
quished opponent as a gesture of reconciliation 
or to witness his victorious champion adopt the 
despised policy of the opposition to gain a con- 
sensus. Or, fifth, he may have been forced by 
external pressure to behave in ways inconsistent 
with his principles and be left with the prob- 
lem of reconciling his behavior with his ideals. 
Or, sixth, he may be presented with new infor- 
mation, via persuasive communications, inter- 
personal interactions, or direct experience with 
the object that does not coincide with his pre- 
vious beliefs. 


Ways of Bringing the Consistency Need to 
Bear on a Given Cognitive Realm 


Given that a certain degree of inconsistency 
has entered the cognitive system and maintained 
itself by some mechanism such as has been dis- 
cussed in the preceding section, how can the 
person be made to feel the pressure of the con- 
sistency need so as to shift to a new quasi- 
stationary equilibrium of higher internal con- 
sistency? A number of procedures towards this 
end have been used experimentally and all 
probably have their real-life analogues. For ex- 
ample, in some studies, the need for consis- 
tency has been brought to bear by the forced 
recall of conflicting cognitions (Abelson and 
Rosenberg, 1958; Festinger, 1957; McGuire, 
1960a). Another procedure (one which some of 
us find methodologically unattractive) is to have 
the person "role-play" certain cognitive constel- 
lations (Jordan, 1953; Morrissette, 1958; Rosen- 
berg and Abelson, 1960). Still another method 
is to force the individual to make some kind 
of behavioral commitment (J. W. Brehm and 
Cohen, 1962) or to confront him with appropri- 
ate communications (Newcomb, 1953). Or, as 
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Abelson and Rosenberg (1958, p. 5) have con- 
jectured, there may be a personality dimension 
with people at one pole which enables them to 
adopt as an intellectual style the reexamination 
of their own positions. For them, and for all of 
us to a degree, the unexamined life may not 
be worth living. No doubt we can stimulate this 


need for self-examination by suitable external - 


manipulations (Glass, in press). 


Modes of Resolving Cognitive Inconsistencies 


The individual who is confronted with a 
cognitive inconsistency has many possible alter- 
native responses to the situation. Here, we can 
take the time to briefly review a few of these 
modes of responding (others are described in 
D. T. Campbell, 1961). It should be understood 
that the use of one of these modes is more 
likely to be complementary to, rather than 
mutually exclusive of, use of the others. 

First, one can always stop thinking, repress 
the matter, put the inconsistency out of mind, 
as Freud and St. Augustine (not necessarily in 
that order) have suggested—as have Dollard and 
Miller (1950, pp. 203-204), Newcomb (1953, 
pp. 400-401), Abelson and Rosenberg (1958, 
p. 5), and Harary (1959). 

Another mode is “bolstering” (Abelson, 1959, 
.p. 345; Festinger, 1957, pp. 21-22) which 
does not remove the inconsistency, but sub- 
merges it among a larger body of consistencies 
so that it looms relatively less large. The per- 
son using this mode, when confronted by a 
contradiction between one of his beliefs and 
another, amasses a great body of other beliefs 
that are consistent with the conflicting one. Or 
there is the somewhat related "mote" method 
where the believer, faced with his own incon- 
sistency, draws solace in pointing out that other 
people are more inconsistent still, as did the 
Moscovite who was showing the American vis- 
itor the beauty of the mosaics in his home-town 
subway. When the American said, "Very nice 
indeed, but the trains? There seem to be no 
trains on these tracks," the Moscovite replied, 
"Yes, and what about the plight of the Negro 
in the South?" 

Two other modes are the somewhat antithet- 
ical processes that Abelson (1959, pp. 345-346) 
calls "differentiation" and "transcendence." For 
example, the man who loves the Bible and also 
loves the theory of evolution, on becoming 
aware that his two loves are not in complete 
agreement, might try differentiating. For exam- 
ple, he might distinguish between the figurative 


Bible and the literal Bible and then decide he 
loves the former, while it is only the latter that 
conflicts with evolutionary theory. Or he might 
try transcendence: he would decide the Bible 
and evolution are the two faces of a higher 
reality, subsumed under a dynamic equilibrium 
of opposites. Those who find this process of 
Hegelian synthesis hard to follow are urged to 
use another mode of conflict resolution, or to 
see Abelson (1959) or Weil (1951), for further 
instructions. 

Still other modes touch on Asch's (1940) no- 
tions about attitude change involving changing 
the object about which the opinion is held 
rather than the opinion about the given object. 
For example, the college student who moves up 
the status of "politican" after being told his 
peers ranked the profession much higher than 
he, may merely be redefining "politician" as 
meaning “statesman” after hearing their high 
ranking, whereas he had at first taken it to 
mean "ward-heeler." Alternatively, he may in- 
deed resolve the dilemma of peer disagreement 
by devaluating the peer or by actually changing 
his opinion about the object, as well as by 
changing the object he is giving his opinion 
about. A final mode that we shall mention is 
devaluating the task when the inconsistency is 
noted. Both Newcomb (1953) and Festinger (e.g. 
1957, p. 22) have suggested that, when con- 
fronted with an inconsistency, the person will 
tend to make it more tolerable by downrating 
the importance of the conceptual area in which 
it occurs. 

In addition to all these modes of responding 
that reduce the inconsistencies, there is always, 
of course, the alternative stance of simply bear- 
ing it. Indeed, one may make a virtue out of it. 
We can say with Carlyle that consistency is the 
hobgoblin of little minds; or adopt the credo 
quia absurdum and find solace in contradictions 
such as Tertullian's irascible notion that the 
Christian creed is impossible and therefore it 
is evident, or Pascal's admission that the exis- 
tence of God is incredible and therefore one 
believes. (But let us not claim for the West any 
exclusive possession of this profundity. It is 
written somewhere in Sikh scripture that if 
there be found any two passages in that work 
that are mutually contradictory, they are both 
true.) 

But the fact that the layman or even the sci- 
entist seems able to tolerate a certain amount of 
ignorance and even inconsistency in his cogni- 
tive arena does not invalidate the general prop- 
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osition of the consistency theorists that incon- 
sistency reduction is reinforcing in much the 
same sense as pain reduction is reinforcing. The 
latter drive also is far from absolute. We all 
learn to tolerate pain and carry on our work 
even when bothered by a moderate degree of 
pain, when this must be endured because we 
lack the response possibility to elude it or 
choose to allow it as the lesser of evils. Indeed, 
we even seek out pain under certain circum- 
stances, as evinced by our proceeding frequently 
if unenthusiastically to the dentist [save excep- 
tions like Silvan Tomkins (1963), who is not 
bothered by dental pain]. Here pain is sought, 
at least in part, because of the need to avoid 
greater pain, and thus is supportive evidence 
for, rather than against, the existence of a pain- 
reduction drive. Hence, we can admit that tol- 
eration and continuance is indeed one mode of 
responding to inconsistency, without deserting 
our proposition that there is a drive to reduce 
inconsistency to low or even zero levels. 

The eight or so responses to inconsistency 
that we have reviewed here—most of which 
serve to provide the protagonist an at least ap- 
parent reduction in his inconsistency—indicate 
a serious problem for consistency theorists. The 
availability of these many alternative modes is 
quite convenient for the person confronted with 
an inconsistency among his cognitions, but it 
is an embarrassment for the theorist trying to 
predict what the person will do. It is unclear 
even whether we should expect the use of the 
alternative modes to be positively or negatively 
correlated. If the persons are in a "zero-sum" 
situation where everyone has a given amount 
of inconsistency, then a negative correlation 
would be expected, the use of one mode serving 
to reduce the inconsistency so that use of other 
modes would be less necessary. But we usually 
cannot assume such a uniform level of internal 
inconsistency. On the contrary, we usually must 
work backwards, and if the individual makes 
more use of a mode than do others, it is usually 
a sign that he has experienced more internal 
inconsistency and will thus find it necessary to 
make greater use of alternative modes as well 
(Steiner and Johnson, 1964). The consistency 
theorist, in this difficult situation, typically tries 
to measure one or a few modes of reduction 
while blocking off the use of other modes, either 
a priori or a posteriori. In either case he leaves 
himself open to criticism. If he uses a priori 
tactics—such as elaborate inductions that at- 
tempt to restrain the subject within certain 


conceptual limits (e.g., Festinger and Carlsmith, 
1959)—for closing off some modes, he lays him- 
self open to alternative explanations and the 
charge that his experimental conditions were so 
complex as to be unreplicable (cf. Rosenberg, 
1965). If he attempts a posteriori closing off, 
such as by eliminating subjects who seem to 
use other modes than the one under study (cf. 
Festinger, 1957, pp. 211 ff), then he becomes 
liable to charges of selecting his evidence and 
obscuring the generalizability of his results (cf. 
Chapanis and Chapanis, 1964, pp. 12-17). Some 
theorists are proceeding to handle this problem 
by examining individual difference factors 
(Glass, in press; Steiner and Rogers, 1963) or sit- 
uational factors (Brock and Buss, 1962) that al- 
low predictions that one mode or another will be 
the preferred route to inconsistency reduction. 


Relative Likelihood of Various Aspects of 
the Cognitive System Changing in 
Case of Inconsistency 

Given that an inconsistency exists between 
elements and that the person responds by 
changing his cognitions, there are several de- 
grees of freedom regarding what might be 
changed to restore balance. For example, if 
one’s expectation and his wish regarding a 
future contingency are in conflict, the person 
can bring them into line by adjusting his ex- 
pectation or his wish or both. Or, if there is 
an imbalanced triad, such that A likes B and 
X, but B dislikes X, balance can be restored by 
A changing his attitude toward B or toward X, 
or by B changing his toward X. Consistency 
theorists have offered a wide range of sugges- 
tions regarding which elements in the system 
are most likely to be changed in order to restore 
consistency. The range of these suggestions and 
findings will be indicated here by mentioning 
a dozen or so of the relevant variables. 

One of the best known predictions regarding 
which cognition is most likely to change in an 
inconsistent system is the Osgood and Tannen- 
baum (1955) polarity principle. In accordance 
with their postulate, if two related concepts are 
incongruously valenced, they will shift toward 
each other, with the less polarized cognition 
(i.e the less extremely valenced one) shifting 
the more. The empirical validity of this postu- 
late is as questionable (Rokeach and Rothman, 
1965) as its fame is widespread. A second sug- 
gestion by Osgood and Tannenbaum is the as- 
sertion principle which states that if a source 
makes an incongruous assertion about a concept 
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they are adjusted toward one another with the 
concept shifting more than the source. (If you 
do not believe these two theorists, then they will 
even give you a constant—+.17 in the usual 
seven-step semantic differential scale—for this 
differential.) Perhaps this can be generalized to 
say that in a discrepant subject-object pairing, 
more adjustment will be made on the latter 
than the former. , 

Two principles that are not logically contra- 
dictory to one another if read closely, but will 
in practice tend to make opposite predictions, 
have been suggested by Abelson and Rosenberg 
(1958) and McGuire (1960a). According to the 
former, the cognition least cathected with affect 
is the one most likely to change. According to 
McGuire, the more distorted (e.g, by wishful 
thinking) cognition will change the most. These 
two positions will tend to conflict, since the 
most affectively charged cognition will usually 
be the most distorted. The Abelson and Rosen- 
berg prediction will usually agree with that 
made from the Osgood and Tannenbaum po- 
larity principle; McGuire (1960a) offers some 
post factum support for his prediction. Both 
positions could be put to a test rather easily 
by an appropriately designed experiment. 

Cartwright and Harary (1956), Festinger (1957, 
p- 27), Abelson and Rosenberg (1958), Rosen- 
berg (1960a, pp. 59-64), and Zajonc (1960, pp. 
293-295) have all argued that the least “bol- 
stered” cognition (ie. the one most isolated 
from relational ties to other cognitions and 
needs) is the one most likely to change. We 
have here a least-effort notion. The person tries 
to avoid a change to reduce one cognitive in- 
congruity that might introduce more imbalance 
than it removes, In addition, Festinger suggests 
that in working out this “bolstering” principle 
we should multiply the number of cognitive 
ties by their importance. J. W. Brehm and Co- 
hen (1962, pp. 201-220) emphasize that we must 
also consider the degree of volition the person 
exercised in the establishment of the tie. Many 
of the dissonance people (e.g., J. W. Brehm and 
Cohen, 1962) have pointed out further that we 
must also consider how irrevocably committed 
the person is to each of the cognitions. 

The Michigan group have demonstrated that 
when balance can be restored by changing either 
instrumentality or goal (i.e., either the person's 
perception of the utility of a given cognition 
for achieving some goal or his perception of the 
attractiveness of the goal itself), the perceived 
instrumentality is the more likely to change 


(Carlson, 1956; Rosenberg, 1956). A somewhat 
related position given in terms more appropri- 
ate to the Osgood and Tannenbaum and the 
Abelson and Rosenberg formulations is that the 
sign of the relationship between concepts is 
more likely to change than the sign of the con- 
cepts themselves. We believe these various for- 
mulations could be summarized by postulating 
that the valence of the links between concepts 
is more likely to change when one of the con- 
cepts is not "ego" than when ego is involved 
as one of the concepts. 

Jordan (1953) and Harary (1959) offer the 
suggestion that, in a system of three or more 
cognitions or individuals, when balance can be 
restored either by making a positive sign nega- 
tive or a negative sign positive, the latter will 
be preferred. 

McGuire (19602) has argued that when we 
elicit the person's opinion on a series of issues 
on which he holds inconsistent positions, our 
questioning produces a “Socratic method” effect 
such that the opinions shift toward greater mu- 
tual consistency on a later occasion for having 
been elicited in close temporal contiguity ear- 
lier. He has argued (McGuire, 1960a) that the 
shift is greater on the opinions listed earlier in 
the series than on the ones elicited later. In 
another study (McGuire, 1960b) he has pre- 
sented some evidence that when a persuasive 
communication argues in an inconsistency- 
increasing direction, the impact is absorbed on 
unmentioned but logically related issues as well 
as on the explicit issue. Immediately after the 
message the impact is mainly on the explicit 
issue, but as time passes, relatively more of the 
effect filters down to the remote issues. 

"This list by no means exhausts the proposed 
answers to the question of "which cognitions 
change more" when internal consistency is be- 
ing restored. Some of the hypotheses have re- 
ceived substantial empirical confirmation; none 
can as yet be ruled out on the basis of available 
evidence. The range of these hypotheses and 
their somewhat untidy hodgepodge appearance 
give testimony to the wide-ranging investigation 
stemming from the diversity of the various con- 
sistency theories. 


Relative Preference. for 
Different Types of Consistency 


A system of cognitions can be in perfect bal- 
ance in a variety of ways. The system can, for 
example, always be put in vacuous balance by 
deleting all the relationships among the cogni- 
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tions. A completely fractionated man who keeps 
each of his cognitions in a separate logic-tight 
compartment will never be bothered with cog- 
nitive inconsistency. At the other extreme, we 
might have a man who has perfect cognitive 
consistency by being a combination of William 
Wordsworth, seeing everything as connected to 
everything else, and Dr. Pangloss, seeing every 
relationship between things as positive. Such a 
person's cognitive system would be in a com- 
pletely connected, and a perfect positive bal- 
ance. Alternatively, the person could be in a 
partly connected, perfect positive balance; or 
he could be in perfect balance with the right 
mixture of positive and negative relations, 
either completely or partially connected. 

It seems generally agreed on theoretical and 
psychological grounds that connected balance 
is preferred psychologically to unconnected 
balance, and unconnected structures tend to- 
wards connected balance (A. Campbell, Con- 
verse, Miller, and Stokes, 1960; Cartwright and 
Harary, 1956; Horowitz, Lyons, and Perlmut- 
ter, 1951; Jordan, 1953; Newcomb, 1961). It 
has also been suggested (Jordan, 1953) that 
positive balance is preferred to negative or to 
mixed positive and negative balance. 

We have discussed these tendencies to posi- 
tivity and to completeness as possible paths 
toward balance. However, in so far as there 
are such tendencies to a pronounced degree, 
they contain a possibility of conflict with the 
tendency toward consistency. We feel this area 
is a rather interesting one, and that it offers a 
fertile field for future research. 


Resolving Inconsistencies by Communication, 
Opinion Changes, and Actions 


What are the uses of consistency theory? 
What realms of behavior of traditional psycho- 
logical interest does it help us predict? Even 
more important, are there new realms of be- 
havior, previously neglected by psychology, 
that consistency theories open to scientific pre- 
dictability? The answers to these questions are 
so rich that one is tempted to say simply, "yes," 
and go on to the next topic. Rather than be 
embarrassed by such riches, we choose to re- 
view briefly some of the implications of con- 
sistency theories chosen from just three of the 
many areas to which it has been applied. 

The first area of application is to communi- 
cation behavior, the question of who sends mes- 
sages to whom. The early work of the Lewin 
group, though undertaken with different theo- 


retical inspiration, fits nicely into the consis- 
tency theory paradigm (Festinger and Thibaut, 
1951; Festinger, Schachter, and Back, 1950). 
The classic "deviation" experiment of Schach- 
ter (1951), which perhaps represents the high- 
point of Lewinian research, suggests still 
further directions that consistency theory could 
take. Newcomb (1953), particularly, has con- 
cerned himself with the communication impli- 
cations of consistency theory. The traditional 
areas of selective avoidance and selective expo- 
sure have likewise been investigated in much 
detail under the inspiration of consistency 
theory, particularly by dissonance people 
(Adams, 1961; Brock, 1965; Mills and Ross, 
1964; Mills et al., 1959; Rosen, 1961; Sears 
and Freedman, 1965). This is one area, how- 
ever, where refutations of the theories have 
come at a rapid rate (J. W. Brehm and Cohen, 
1962, pp. 48-49, 51-52, 92-97; Steiner, 1962). 

The notion of proselytizing after the discon- 
firmation of a belief is a particularly intriguing 
notion of consistency theory. This is the pre- 
diction that if a person has irrevocably com- 
mitted himself by action on the basis of a 
belief (eg, he takes public and costly action 
because he believes the world is going to end 
on a certain day) and then the belief is dis- 
confirmed (the day comes, but not The End), 
then he rather paradoxically proselytizes for 
his belief more avidly after the disconfirmation 
than before, in order to convert others and 
thus bolster his sagging confidence in his wis- 
dom in having committed himself to this be- 
lief. This notion would explain some historical 
occurrences of more than a little importance 
that have puzzled many. It is an appealing 
proposition and deserves to be true. So far, 
however, the empirical evidence for it has not 
been overwhelming (Brodbeck, 1956; Festinger, 
Riecken, and Schachter, 1956; Hardyck and 
Braden, 1962). 

A second area to which consistency theory 
has been applied with considerable success is 
that of attitude change. We may here review 
only a very skimpy selection of the large body 
of research in this area. McGuire (19602) has 
shown that opinions can be changed by the 
Socratic method of simply asking questions on 
related issues, sensitizing the individual to the 
fact that on the basis of his own premises, his 
conclusions do not follow. This sensitization 
has proved sufficient to produce a significant 
trend toward greater internal consistency when 
the same beliefs are later tapped a second time. 
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McGuire (1960b) has also shown that the per- 
suasive impact is felt, not only on the explicit 
issues, but also on unmentioned logically re- 
lated issues. He has shown (1960c), further, 
that the persuasive impact is greater when the 
message argues in an inconsistency-decreasing 
direction than when it increases consistency, 
and that there tends to be a delayed-action 
persuasive effect on beliefs logically related to 
the ones with which the persuasive message 
explicitly deals. He has also studied the impli- 
cations of consistency theory regarding ways of 
inducing resistance to persuasion (McGuire, 
1964). 

Another persuasion area to which consistency 
theories have been applied with a stimulating 
degree of theoretical controversy and an attrac- 
tive amount of empirical ambiguity has been 
that of the discrepancy-involvement question. 
How does the amount of obtained opinion 
change vary as a function of the amount of 
change urged, and how does this relationship 
vary as the believer's “involvement” increases? 
There seem to be opposite predictions on the 
basis of dissonance theory (Zimbardo, 1960) 
and of assimilation-contrast theory (M. Sherif 
and Hovland, 1961; C. W. Sherif et al., 1965). 
Zimbardo (1960) has proposed a plausible 
reconciliation, but the issue is still in doubt 
(Bergin, 1962; Freedman, 1964; Greenwald, 
1964). 

A final area of application of consistency the- 
ory that we shall review briefly is to the rela- 
tions of belief and action. It was long believed 
that, at least to some extent, belief was a deter- 
minant of action. Consistency theories (particu- 
larly the dissonance formulation of Festinger 
and his followers) have called attention to the 
frequent occasions on which this direction of 
causality is reversed and the person's action 
produces a belief to justify it. Instead of the 
classical actio sequitur esse, we have "as the 
person acts, so he becomes," 

At the moment there is a lively controversy 
between a conflict theory (Janis, 1959), which 
Suggests that the cognitive resolution occurs 
before the decision, and dissonance _ theory 
(Festinger, 1964), which suggests that the cog- 
nitive adjustments are postdecisional. So far, 
the dissonance theorists seem a little ahead, 
but the game is in the early innings and the 
country is yet to be heard from. Another quite 
lively scene in this same area involves inter- 
nalization of forced compliance as a result of 
role-playing belief in a position that one does 


not at the time really accept. From Kelman 
(1953), through Janis and King (1954) and the 
dissonance theorists (J. W. Brehm and Cohen, 
1962, pp. 117-121, 248-258), this has been a 
fertile field of inquiry, and quite suggestive 
regarding cognitive functioning. Equal activity 
is found in the area of postdecisional reorgani- 
zation following choice among alternatives; this 
area was opened by J. W. Brehm (1956; 
Brehm & Cohen, 1962). As one other of the 
less investigated but ingenious notions about 
beliefs and action investigated by consistency 
theorists, we might mention the Aronson and 
Mills (1959) study on attachment to a group 
as a consequent of traumatic initiation into it. 
The current state of the question in a number 
of the areas just reviewed has been summarized 
recently by McGuire (1966). 

Clearly, the consistency theorists have been 
moving ahead on many fronts. It will come as 
no shock, though, when I say not all questions 
have been answered to everyone's satisfaction. 
What are some of these unanswered questions 
that confront consistency theories today? A re- 
view of some of these outstanding issues will 
constitute our next section. 


ISSUES CURRENTLY CONFRONTING 
CONSISTENCY THEORY 


A number of issues and points of disagree- 
ment have arisen among consistency theorists 
which have stimulated work in the past and 
will undoubtedly provoke research for some 
time to come. Some of these issues constitute 
unresolved puzzles for all the theories; on other 
issues, conflicting solutions have been proposed 
by the various theories. We shall in this section 
review a wide range of such issues. Some of 
these arise in connection with all or most of 
the approaches; others are relevant to only a 
few of our approaches. Also the several issues 
differ widely as to relative importance. We 
have attempted to cast a wide net in this re- 
view of unresolved problems. For convenience, 
we have listed them under three headings: defi- 
nitional, empirical, and methodological prob- 
lems.1 


1 In November, 1963, four of us who participated 
in this symposium (Abelson, Aronson, Rosenberg, 
and myself) plus two others (Percy Tannenbaum 
and Theodore Newcomb) met at Newcomb's house 
in Ann Arbor, Michigan, to review problems in 
consistency theory—especially those issues on which 
we felt our respective theories might yield different 
predictions. The following month, I wrote a review 
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Issues Involving Definitions 


We include here some outstanding questions 
regarding the elements, cognitive or otherwise, 
to which consistency theory should be applied. 
Besides those regarding definitions of the units, 
questions arise regarding the relationships 
among those units that are to be considered. 
And, as always in the empirical sciences, we 
are groping toward exactitude; therefore prob- 
lems of definition lead to problems of quantifi- 
cation. 


Interpersonal versus intrapersonal units. 
Some of the consistency theories (eg. that of 
Cartwright and Harary, 1956) have been ad- 
vanced to apply to interpersonal relations in 
social systems, so that the units they deal with 
are persons and the relations whose consistency 
is being considered are, e.g., the likes and dis- 
likes among these people. Other consistency 
theories (e.g., those of Abelson and Rosenberg, 
1958; Festinger and Aronson, 1960; Fishbein, 
1963; Fishbein, 1965; McGuire, 19606); Mc- 
Guire, 1960c; Osgood and Tannenbaum, 1955) 
apply to cognitions within persons, so that the 
units are concepts or propositions and the rela- 
tions are, eg. implications or other logical 
relations among these intrapersonal cognitions. 
Some of these intrapersonal systems deal with 
the consistency among cognitions on related 
issues (McGuire, 1960c); others deal with con- 
sistency of different aspects of the cognitions 
on a single issue (Fishbein, 1963; McGuire, 
1960a; Rosenberg, 1960b). Other consistency 
theorists (Newcomb, 1959; Heider, 1958) deal 
with mixed interpersonal and intrapersonal 
systems. 

It seems likely that the differentappearing 
mathematics advanced for these two domains 
[e.g Hararys graph-theoretical approach (cf. 
Cartwright and Harary, 1956; Harary, 1959) 
and Abelson's matrix algebraic approach (cf. 
Abelson and Rosenberg, 1958)] are isomorphic, 
but the point could bear further scrutiny. Even 
if the various mathematical formulations are 
equivalent, it is highly probable that the two 
domains differ substantially as regards the psy- 
chological and sociological propositions needed 
to predict tolerance for inconsistency, effects 
of inconsistency, and the preferred modes of 


of the problems in the area (McGuire et al., 1963), 
based partly upon that Ann Arbor conference. That 
review has been circulated in preprint form, but 
to make it more generally available, I shall repeat 
here most of what I said in that informal report. 


consistency restoration. Analogies might be 
useful in crossfertilizing both applications of 
the model; eg. restoration of consistency in 
the contentious social system by schism into 
noncommunicating factions seems analogous to 
the individual's restoring consistency to his 
cognitive system by segregating sets of beliefs 
into logic-tight compartments. An examination 
of the means, efficacy, and problems involved 
in each analogue should be suggestive regard- 
ing the other analogue. 

Definition of negative relations. The basic 
problem of defining a negative relationship 
between two units remains unsettled in a num- 
ber of regards. It seems clear that "psycho- 
logical implication is much broader than 
logical implication (Rosenberg and Abelson, 
1960). Festinger's unfortunate choice of "ob- 
verse" (1957, p. 13) is illustrative of the con- 
fusion in the field. Whether a cognition should 
be opposed with its contradictory or contrary 
for best empirical applicability has been raised 
by a number of theorists (Cartwright and 
Harary, 1956, pp. 280-281, 290-291; Heider, 
1946, p. 107), but remains unanswered. The 
question has also been raised (cf. Harary, 1959, 
p. 317) whether or not any relations can be 
defined per se as negative: eg., Deutsch 
(Deutsch and Solomon, 1959; Deutsch, Krauss, 
and Rosenau, 1962) has suggested that for a 
person with a negative self-concept, another 
person's dislike for him would be a positive 
relationship (cf, also, Heider, 1946, p. 111; 
Heider, 1958, p. 210). Studies need to be de- 
signed to clarify and answer these questions. 

Types of relationship. It has been argued, 
particularly by those who apply consistency 
theory to mixed inter- and intrapersonal struc- 
tures, that in many cases the relationships in- 
volved in the structure should be partitioned 
into two or more subsets with the consistency 
measure having to be applied separately to 
each subset. Heider's (1946; 1958) type U and 
L relationship is an example. The question 
mentioned above, of whether the negation of 
relation should, psychologically, be its contrary 
or its contradictory, may have to be resolved 
differentially for different types of relations. 
Cartwright and Harary (1956) have suggested 
type-t graphs as a method of handling systems 
involving different classes of relations. Both 
psychological and mathematical questions need 
consideration before we can make a beginning 
here. 

Another problem involves bi-directional rela- 
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tionships, which arise particularly (though not 
exclusively) in connection with interpersonal 
links: The bond from A to B may be "likes" 
and that from B to A may be "dislikes." Such 
asymmetrical links provide much drama in hu- 
man relations, actual and fictional, and much 
technical difficulty for consistency theorists. 
The mathematics of digraphs has been sug- 
gested (Cartwright and Harary, 1956) and 
Abelson's matrix method (cf. Abelson and 
Rosenberg, 1958) seems capable of handling 
the problem by allowing noncorresponding 
signs above and below the diagonal. 

Quantification issues. As in almost every 
other behavioral science, problems of measure- 
ment abound in consistency theory and the 
state of the art as regards scaling is pitiful. 
We shall restrict mention here to quantification 
issues that are critical for our contending con- 
ceptions of basic models of how humans operate. 
For example, in Festinger's "dissonance" ver- 
sion of consistency theory, the tension or pres- 
sure to change as a result of inconsistency is 
defined in terms of the ratio of dissonant to 
total (or sometimes to consonant) elements 
(1957, p. 17). Obvious alternative definitions 
are suggested in early works of Kurt Lewin 
(e.g. 1951) and Neal Miller (e.g., 1944) on con- 
flict resolution where the resulting tension is 
an intervening variable defined in terms of 
simply the magnitude of the weaker force or 
of the absolute difference (rather than ratio) 
between the two forces. Consistency theory ex- 
periments could be designed, we hope, to dis- 
tinguish between these formulations as regards 
empirical relevance, thus providing findings 
whose implications would extend outside con- 
sistency theory. A possible resolution would be 
to develop an index that reflected both the 
relative and absolute level of the inconsistent 
cognitions (cf. J. S. Brown and Farber, 1951, 
pp. 483-486; Worell, 1962). 

Another quantification problem that needs 
study has to do with the strength of the indi- 
vidual links in the structure. Most current 
theories (Abelson and Rosenberg; Heider, Cart- 
wright and Harary; Newcomb; etc.) restrict the 
system to three-valued links (+1, 0, —1). Fes- 
tinger has, in principle, the beginnings of more 
precise quantification in the “weighting by 
importance” notion. Osgood and Tannenbaum 
allow, in effect, a seven-point scale in their 
evaluation of the individual concepts on a plus- 
three to minus-three scale. McGuire in prin- 
ciple allows a continuous scale in his 


probabilistic gradations of assent (1960c, p. 75), 
and has recently also developed an IPL-V pro 
gram evaluating consistency in structures in 
which the links can have any value between 
plus one and minus one—in effect, a continu- 
ous scale. The issue is not just a question of 
precision, but a fundamental question of 
whether organisms behave as if they operate 
on continuous or discrete scales, in infinite 
gradations or quantum jumps, as an analogue 
computer or as a digital computer. It is con- 
ceivable that experiments be designed to test 
between such alternative depictions, which 
have implications far beyond consistency 
theory. ? 

Still another quantification problem is that 
all the present consistency models posit a mul- 
tiplicative basis for determining the signs of 
cycles; i.e., they assume that two minuses make 
a plus, or, psychologically speaking, that hu- 
mans operate on the principle that "my enemy's 
enemy is my friend." However, an alternative 
additive model is conceivable, in which two 
minuses would make a still more minus result; 
ie, that humans say in effect, "if even this 
creature A, whom I hate, hates B, then I would 
find B doubly odious.” Further examination 
might yield a solution such that, in some do- 
mains of behavior, the multiplicative model is 
more appropriate, and in others, the additive 
one. 

A similar-sounding, but quite different ques- 
tion is the additive versus averaging con- 
troversy, regarding resolution of cognitive 
conflict. For example, given that a stimulus 
person is described by several exceedingly good 
traits, what is the effect of ascribing to him 
several additional traits that are mildly good? 
Is an additive model correct, such that the 
additional (mildly) positive traits result in his 
being even more favorably evaluated (Abelson, 
1961; Fishbein and Hunter, 1964; Gulliksen, 
1956; Hammond, 1955; Triandis and Fishbein, 
1963)? Or is the averaging model more appro- 
priate, so that adding the only mildly positive 
traits results in an averaging out to a lower 
net evaluation than would the extremely posi- 
tive trait alone (Anderson, 1962; D. T. Camp- 
bell, 1961; Feldman, 1962; Osgood and 
Tannenbaum, 1955; Rimoldi, 1956; Weiss, 
1963)? The question is an attractive one in 
that it is accessible to precise empirical study 
which promises an answer that will improve 
the question. As might be expected, there is 
evidence of both trends and a sophisticated 
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reconciliation seems in the offing (Anderson, 
1965). 

A final quantitative problem that we shall 
advert to here is that of weighting certain sub- 
structures in determining the inconsistency 
level. For example, it has been suggested by 
several theorists that cognitive cycles should be 
somehow weighted inversely as their length, 
on the plausible conjecture that an inconsis- 
tency in a long chain of cognitive associations 
(or of social relations) is psychologically more 
tolerable than one in a short chain, and thus 
yields less pressure to change. Alternative 
weighting procedures have been suggested by 
Cartwright and Harary (1956, p. 289). If we 
adopt McGuire's (1960c) procedure, mentioned 
above, of scoring the individual link on a con- 
tinuous plusone to minus-one scale, this prob- 
lem of weighting for chain length would tend 
to be handled automatically. 


Empirical Issues 


A. second large class of issues we group here 
as empirical issues. The distinction from the 
"theoretical" issues listed above is somewhat 
arbitrary. However, above we grouped as theo- 
retical those issues having to do with the 
higher-order postulate and the fundamental 
inspiration of the theory; whereas here, as em- 
pirical issues, we include problems of further 
specification that must get tacked on to the 
core theory by empirical exploration or theo- 
retical inspiration deriving from other view- 
points, rather than from the intrinsic character 
of consistency theory per se. For example, we 
shall discuss: (l) variations in tolerance for 
inconsistency as a function of personality, situ- 
ational, and other factors; (2) alternate modes 
of dissonance reduction and factors affecting 
the preferential ordering among these; (3) fac- 
tors purportedly necessary for arousal of the 
need for consistency, such as awareness, choice, 
commitment, etc.; (4) conjectures regarding the 
acquisition of the need for consistency, having 
to do, eg., with child-rearing practice or later 
role assignment; and (5) time factors involved 
in the operation of consistency need. 

Tolerance of inconsistency. One would ex- 
pect to find reliable individual differences in 
tolerance for inconsistency. Aronson and Fes- 
tinger (1958) did early work in this field. 
Steiner (1954; Steiner and Johnson, 1963) has 
had some success in teasing out personality and 
demographic correlates of tolerance for incon- 
sistency. Bieri (1961) has worked on personality 


correlates, as has Rosen (1961). Sex, intelligence, 
maturity, age, and many personality variables 
have been found (or theorized) to be related 
to tolerance for inconsistency. Others have 
found it more appropriate to look for inter- 
actions than main effects. Steiner and Johnson 
(1963), e.g. find that the individual difference 
variables tend to be related to preferred modes 
of dissonance reduction as well as to general 
tolerance. Rosenberg and Hovland (1960) point 
to interactions beween personality and situ- 
ational (eg. type of issue) variables in deter- 
mining inconsistency tolerance. This important 
individual-differences aspect of consistency the- 
ory is still underinvestigated and underana- 
lyzed. David Glass (in press) has recently sup- 
plied an extensive review of work in this area. 

Also needing further investigation are situ- 
ational determinants of tolerance for incon- 
sistency. There is a complex pattern of 
confirmations and disconfirmations of consis- 
tency-theory predictions running through the 
literature (c, eg, Chapanis and Chapanis, 
1964; Silverman, 1964; McGuire, 1966). There 
seem to be some regularities to the partition 
into positive and negative results, but so far 
these are seen through a glass, darkly. Rather 
than a continuing debate about the general 
validity of consistency theory, we hope for a 
situational analysis of the two well-filled classes 
of studies, to tease out variables that will better 
delimit the proper sphere of applicability of 
consistency theory and help place the funda- 
mental postulates of this theory into their 
proper place among the principles of behavior. 

Alternative modes of inconsistency reduction. 
As we discussed earlier in this chapter, incon- 
sistency assumes multiple components, and 
hence it can be reduced in multiple ways. 
Identifying the likely target of change is a most 
interesting aspect of consistency theory that 
has preoccupied most of us from the start. 
Abelson (1959) has written at length on dif- 
ferent cognitive strategies for restoring balance. 
Festinger and Aronson (1960) have also dis- 
cussed this problem. McGuire (1960c) has 
studied the logical character of propositions 
that make them most likely to change in restor- 
ing consistency. Newcomb has studied the rela- 
tive likelihood of perceptual changes and 
objective relational changes in the strain to- 
ward symmetry (cf. 1961, pp. 254-257). Rosen- 
berg has long studied this problem in such 
terms as the relative likelihood of means and 
ends changes (1956) and of affective versus in- 
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formational changes (1960). Tannenbaum 
(1956) has investigated the probability of 
change as a function of initial polarization 
and in quantifying the “assertion constant.” 
Yet despite (or because of) all this work, the 
issue remains conceptually confused and cries 
for further, less haphazard study with these 
diverse approaches made relevant to one 
another. 

Purportedly necessary conditions for incon- 
sistency arousal. A number of characteristics, 
mostly having to do with the subjective state 
of the subject (sometimes operationally de- 
fined), have been posited as necessary for the 
occurrences of the predicted inconsistency 
effects. “Awareness” has not been made espe- 
cially troublesome. McGuire (1960a) has argued 
that his data imply that awareness of the incon- 
sistency is unnecessary. The “felt discomfort” 
of dissonance theory (Festinger, 1957) appar- 
ently should not be taken to imply a necessary 
phenomenological experience on the part of 
the subject, though Brehm and Cohen, in their 
contrasting of psychoanalytic and dissonance 
theory, are ambivalent on this point (1962, pP. 
167-168). Yet the revival of interest in "aware- 
ness" in connection with studies on verbal con- 
ditioning (Eriksen, 1962) and experimenter bias 
(Rosenthal, 1964) invites a reanalysis in the 
present case: eg, it is necessary to be aware 
at least of the elements themselves, even if not 
of the inconsistency between them, in order 
for the consistency theory predictions to be 
borne out. 

The most investigated of the proposed "nec- 
essary condition" variables is probably that of 
choice or volition. Left rather vague in Fes- 
tinger's original formulation of dissonance the- 
ory (1957), the fait accompli experiments of his 
interpreters, Brehm and Cohen (cf., e.g., 1962, 
Pp. 188-220), purport to show that the subject 
must feel he, to some degree, voluntarily en- 
tered the subsequently dissonant situation, 
though he did not necessarily foresee the dis- 
sonant element in it. The subsequent empirical 
study and theoretical extension of this notion 
has yielded a seemingly contradictory set of 
results. The theoretical and empirical status of 
the choice variable badly needs further analysis. 

Still another such variable whose empirical 
status is ambiguous and whose theoretical in- 
terpretation is debated is that called “commit- 
ment.” It remains uncertain to what extent 
various manipulations under this concept can 
be considered equivalent; viz., active participa- 


tion (King and Janis, 1956), overt compliance 
(Festinger, 1957, pp. 84-122), response restric- 
tion (Kelman, 1953), public versus private com- 
mitment (Bennett-Pelz-v. Bennett, 1955), ex- 
ternal commitment (Rosenbaum and Franc, 
1960), and somewhat more distantly related, 
reactance (J. W. Brehm, 1963). Further exami- 
nation of the theories, the experimental oper- 
ations, and the apparently conflicting results 
should help clarify the processes involved in 
the effects of these overlapping manipulations. 

Origins of the need for consistency. It has 
purportedly been shown (Lawrence and Fes- 
tinger, 1962) that animals other than men oper- 
ate according to consistency theory. On the 
other hand, it has been amply demonstrated in 
the work on alternation and exploratory be- 
havior, effects of infantile stimulation, etc. 
(Berlyne, 1960; Fiske and Maddi, 1961; Fowler, 
1965), that various albinus Rattus norvegicus 
also have the somewhat opposite "need for 
complexity." All of this suggests we might look 
for the acquisition of need consistency in our 
phylogenetic origins, rather than the life his- 
tory of the individual. However, two onto- 
genetic sources of need-consistency have been 
suggested and might repay a closer look: child- 
rearing practices and role assignment. Harvey, 
Hunt, and Schroeder (1961) have analyzed ex- 
tensively how parents' rearing practices and 
the child's own maturational level may interact 
to affect the level of complexity which the 
individual subsequently finds optimal. New- 
comb's role-theory approach suggests that sub- 
sequent life experiences and late status changes 
can influence the person's need for consistency. 

Even cross-cultural data are available on the 
point at issue. Child, Barry, and their co- 
workers have presented a number of studies 
(e.g, Barry, 1957) based on data from the Hu- 
man Relations Area Files showing societal and 
cultural variables related to modal preference 
for openness and complexity as symmetry and 
balance in art (and to a lesser extent, theologi- 
cal systems). Hovland, Lumsdaine, and Sheffield 
(1949, pp. 310-311) have commented on the 
"consistency reaction" one purportedly gets 
with American subjects when using the before- 
after design as contrasted with the "inconsis- 
tency reaction" such a procedure elicits from 
Syrian subjects. 

Temporal factors. It is always captivating to 
deal with predictions involving the time vari- 
able because, as few others encountered in the 
behavioral sciences, this dimension is easily 
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measurable and has well-understood scaling 
properties. Consistency theory has encountered 
this variable in at least two contexts. Leaving 
aside the question of awareness, McGuire 
(19602) has considered to what extent the elici- 
tation of two de facto inconsistent cognitions 
must be temporally contiguous for the need for 
consistency to produce a change in them. Put 
more elegantly, the question is asked: To what 
extent does the pressure toward consistency 
decay as a function of time beween elicitation 
of the inconsistent elements? 

A second temporal issue is raised by the work 
of Elaine Walster (1964; Festinger and Walster, 
1964); it shows that under some conditions the 
postdecision reevaluation towards consistency 
shows a nonmonotonic relationship to time. 
There is an immediate postdecision "regret" 
period with reevaluations away from greater 
consistency with the choice, but a long-term 
trend toward greater consistency. 

These openings of the temporal issue have 
obviously raised more questions than they have 
answered, as has McGuire's (19605) study on 
rate of return to the initial “inconsistent” state 
as time passed from the occasion on which it 
had been made salient. Freedman (1965) also 
has presented interesting data on this relation- 
ship. 


Methodological Issues 


This third rubric under which we group 
issues for discussion is, again, only imperfectly 
distinguishable from the other two types of 
issues (definitional and empirical) considered 
above. We group here, for convenience, those 
problems of manipulating variables or analyz- 
ing data that seem to arise again and again 
in connection with consistency-theory experi- 
ments and in regard to which work in the area 
has been subjected to severe criticism—both 
the self-criticism of the practitioners and the 
rather harsher criticisms of outside observers 
(see McGuire, 1966, for a review). None of the 
problems to be mentioned is found only in the 
consistency-theory area, nor are these the only 
methodological problems in the area, but these 
four arise frequently enough to bear mention. 
They include: (1) analytic and procedural 
problems arising from the availability of mul- 
tiple modes of inconsistency reduction; (2) 
problems of interpretation and replication, 
arising from the tendency toward using rather 
complex social settings and complicated and 
subtle manipulations in many consistency- 


theory experiments; (3) generalization problems 
arising from the frequent use, in consistency- 
theory experiments, of techniques which are 
not themselves well understood, such as role- 
playing, hypnosis, and computer simulation; 
and (4) the problem of obtaining a direct mea- 
sure of inconsistency. We shall mention briefly 
problems in each of these areas. 

Problem of multiple modes of resolution. It 
has already been pointed out that in any in- 
consistency situation there are inevitably alter- 
native ways in which the system can return to 
consistency. Indeed, as was mentioned above, 
the consistency theorists have been quite active 
in positing new ways of reducing inconsistency. 
Problems of method arise when the investigator 
manipulates inconsistency to study the effect 
on one particular mode of inconsistency reduc- 
tion. Strategies for dealing with the likelihood 
that other modes will also be utilized for in- 
consistency reduction have exposed these re- 
searchers to much criticism. The tactic of 
"blocking off" the alternative modes has usually 
resulted in the devising of very complicated 
and artificial situations, which gives rise to 
questions of interpretation and generalization 
(see next section). The alternative tactic of 
closing off these other avenues post factum by 
eliminating subjects who make extensive use 
of other modes has also been severely criticized. 
We need to consider alternative tactics and 
superior methods of computation (e.g. covari- 
ance and multivariate analysis) for handling 
this problem which is intrinsic to research on 
consistency theory and, indeed, to many other 
areas of research. 

The complex manipulation problem. Consis- 
tency-theory researchers are frequently criticized 
also because of a tendency to use extremely 
complicated experimental situations and to 
carry out rather difficult manipulations of vari- 
ables in testing their hypotheses. As a result, 
there arise questions of how the subject per- 
ceived the situation (Rosenberg, 1965), and of 
extraneous variables operating, unappreciated 
by the experimenter, but perhaps more im- 
portant in producing the obtained effect than 
were the variables to which he attributed these 
effects. Numerous studies have been criticized 
as hard to interpret and generalize on these 
grounds. For example, the studies manipulating 
choice avoided subject self-selection by giving 
the subjects a pseudochoice utilizing a compli- 
cated sleight-of-hand induction. In this way 
all the subjects, even those in the "high" choice 
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condition, will make the same choice as the 
subjects in the low choice conditions, but some 
will feel they had more volition in so choosing. 
The complexity and subtlety of the experi- 
mental situations not only make interpretation 
equivocal but also render replication in other 
laboratories difficult, a very serious shortcoming 
for a supposedly scientific theory. Hopefully, 
further discussion and analysis of this problem 
will yield methods of circumventing these com- 
plex subtle inductions or at least will indicate 
to what extent consistency theory predictions 
depend on such elaborate techniques or are 
confirmed even without their employment. 

Questionable techniques. For practical rea- 
sons, consistency theory researchers have made 
more use of methodological techniques that 
are themselves not well understood than have 
workers in most other fields of psychology. For 
example, several investigators have tested pre- 
dictions in a role-playing situation, which al- 
ways leaves it open to question whether the 
subject's responses can be generalized to his 
ordinary behavior, or whether they instead rep- 
resent his conception of what the experimenter 
wants him to do or what a rational man would 
do. Also open to question is the use of hypnosis 
in some consistency studies. The hypnotic pro- 
cesses are themselves more poorly understood 
than those consistency processes that they are 
being used to elucidate (Hilgard, 1965). Still 
others of us have used computer simulation, 
even though we are well aware that this strat- 
egy leaves the theorist with an embarrassing 
number of degrees of freedom. Each of these 
techniques, of course, has its defenders as well 
as its attackers; cf., e.g., R. Brown (1962, p. 74) 
on the value of role playing. We can hardly 
hope in this volume to solve the problems of 
role playing, hypnosis, and computer simulation. 
But we can hope to get an inkling of whether 
a substantial enough body of consistency- 
theory results is confirmed by both these and 
more prosaic techniques so that we can feel 
freer in the future to use these more venture- 
some procedures to study new processes that 
might otherwise remain uninvestigated. 

The criterion problem. The motivational 
state of imbalance, asymmetry, discomfort, dis- 
sonance, etc., postulated by consistency theory 
is an intervening variable. The predictions by 
which the theory is tested typically use as their 
dependent variables quite derivative behavior 
only distantly related to inconsistency and af- 
fected also by many antecedents other than in- 


consistency. As a result, the experiments are 
often insensitive as tests of consistency theory, 
and each individual experiment usually can 
be plausibly explained on bases other than con- 
sistency theory. It would be useful if we had 
more direct measures of this postulated internal 
state. The two obvious roads to making it a 
hypothetical construct would be to give it sur- 
plus meaning in phenomenological or in physi- 
ological terms. In this way we could hope to 
measure inconsistency level more directly by 
means of the subject's verbal reports of his 
felt inconsistency (a course which is unattractive 
for several reasons), or by means of physio- 
logical indexes. A start has been made toward 
developing the latter, e.g, by M. L. Brehm, 
Back, and Bogdonoff (1964) who have used 
blood cholesterol, and by Gerard (1964) who 
has employed electrophysiological indexes. The 
latter has also used simple physical response 
indexes like reaction time. It would be useful 
to examine the possibilities of further develop- 
ing indexes along these lines. 


THE PLACE OF CONSISTENCY THEORY IN 
THE LARGER PSYCHOLOGICAL SCENE 


Our remarks so far may have revealed our 
belief that consistency theory is a Big Thing 
in current psychological thinking and that it 
has in the recent past constituted a very fertile 
approach for those experimenting in social and 
personality psychology. In closing, we should 
like to suggest where consistency theory fits 
within the larger psychological picture—how 
it relates to some other general formulations, 
especially motivation theory and the functional 
approaches. Also, we should like to point out 
that while, at the moment, consistency theory 
may be a big thing, it has not always been so 
and it will again not be so, in the foreseeable 
future. We shall glance ahead and suggest that 
the inevitable corrective to consistency theory 
is already visible on the horizon. 


Relation to Other Approaches 


Consistency among the motives. What is "con- 
sistency” as used in this discussion? It is a state 
of interrelation among cognitions and inter- 
personal bonds. We speak of the individual 
tending toward this state. Hence we view con- 
sistency as a tendency, a drive, a motive. Should 
we then consider consistency as one more mo- 
tive, and seck to answer the questions about 
consistency that are being framed with more 
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and more sophistication (J. S. Brown, 1961; 
Cofer and Appley, 1964) regarding motives in 
general? Should a book called “The Consis- 
tency Motive” take its place on the shelf along 
with “The Achievement Motive" (McClel- 
land, Atkinson, Clark, and Lowell, 1953), 
“The Affiliation Motive” (Schachter, 1959), 
“The Approval Motive” (Crowne and Mar- 
lowe, 1964), etc.? My personal feeling is that 
if this process of finding new motives continues, 
the psychologist with his set of motives will 
offer the intelligent onlooker as absurd a pros- 
pect as the current spectacle of the physicist 
and his ludicrous set of particles. I would feel 
like King Alfonso who, after hearing a descrip- 
tion of the later epicyclic theory of the solar 
system, said, “It may be so, But let me tell 
you, had I been the creator, I could have done 
the thing more simply.” 

Yet while we would be grieved to see con- 
sistency tending raised to the status of a mo- 
tive, like a battle-weary British politician 
elevated to Lords, we agree that the concept 
can perhaps be fitted into current pyschological 
thinking more appropriately as a motive than 
as anything else [not withstanding that long 
ago Summer argued that consistency was only 
a strain found in the realm of social systems 
and not in the heads of individual irrational 
man (cf. 1906, pp. 4-5, 473)]. We feel that clari- 
fication of the concept and heuristic sugges- 
tions for further research will come more from 
the motivational analysis of need for consis- 
tency than from any other single endeavor. 
I am quite hopeful that this advances us to- 
ward a clearer notion of consistency as a mo- 
tive and better formulated research questions 
on the motivational issues than we had at the 
outset. The question has not been neglected 
before. Dissonance theorists have often pre- 
sented derivations that seem in conflict with 
those derived from SR reinforcement theory 
(Aronson, 1961; Aronson, 1963; Lawrence and 
Festinger, 1962; Lott, 1963; Mowrer, 1963) or 
from more traditional hedonic theories (Aron- 
son and Carlsmith, 1962; Festinger and 
Carlsmith, 1959; Janis and Gilmore, 1965; 
Rosenberg, 1965). 

Other past discussions of consistency as a 
motive have included Newcomb's (1953) analy- 
sis of possible advantages of consistency in our 
usual environment. He suggests that whether 
or not it is intrinsically rewarding, consistency 
does pay off and thus would tend in any case to 
have secondary reward value. Several of the 


consistency theorists have questioned whether 
the motivational state involves consistency per 
se. J. W. Brehm and Cohen (1962, pp. 223-231) 
suggest that the evidence indicates that incon- 
sistency is not ipso facto motivating but be- 
comes so only when it frustrates another 
motive. Osgood (1957) suggests that consistency 
is important psychologically only insofar as it 
subserves the need for meaning. 

Consistency among the functions. Several the- 
orists have approached the attitude area from 
a “functional” point of view, which attempts 
to explain and predict the person’s attitudes, 
their interrelations and susceptibility to change, 
and their relation to action, in terms of the 
functions they may serve in his total psycho- 
logical economy. This approach obviously also 
entails an analysis of human motivation. Vari- 
ous workers have employed this approach, with 
the “authoritarian personality” study (Adorno, 
Frenkel-Brunswik, Levinson, and Sanford, 
1950) being the best known of the early work. 
Smith has also been long working in this area 
(Smith, 1949; Smith, Bruner, and White, 1956) 
as has DiVesta (DiVesta and Merwin, 1960; 
Woodruff and DiVesta, 1948). Perhaps the most 
active group has been that at Michigan around 
Katz, Sarnoff, Stotland, et al. (cf. Katz, 1960; 
Katz and Stotland, 1959, for reviews). 

Attitudes have been analyzed as serving many 
functions: utility, expression, meaning, ego, 
defensiveness, etc. It seems likely that main- 
tenance of consistency plays an important role 
in all of these functions; certainly, in the last 
two. We feel that an integration into the func- 
tional approach is another avenue by which 
consistency theories, to the mutual benefit of 
both kinds of approach, could be brought into 
heuristically provocative interaction with other 
approaches. 


The Rise and Fall of Consistency Theory: 
End of the Classic Phase 


We have mentioned that the consistency ap- 
proach was not always so dominant a theme 
in psychological thinking as currently. In fact, 
until the late 1940's it was decidedly out of 
fashion. Perhaps in the early nineteenth cen- 
tury the notion of consistency and the rational 
man reached its height of popularity for the 
Modern Era. The idealism and grand syntheses 
of the German philosophers such as Kant and 
Hegel were exceedingly popular in Nineteenth 
century America from the Mississippi to the 
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Charles and even in the few cultural enclaves 
beyond the old frontier. 

But there was an extreme swing away from 
this rationalism in American psychology in the 
first half of this century. Whether inspired by 
the biologism of Darwin or the preoccupation 
with psychopathology stemming from Freud, it 
was ganz Amerikanisch to view behavior as 
nonrational, fragmented, unintegrated. Perhaps 
one could imagine a Pavlov or a Thorndike 
smiling tolerantly at the consistency theorists; 
but J. B. Watson would have exploded vio- 
lently and Clark Hull would have refused to 
believe they existed, except perhaps as another 
bad joke from the West Coast extremists. 

Still, there is a Yin and Yang in the course 
of scientific progress as in other human pro- 
cesses, and once again the consistency approach 
is highly popular in some circles and at least 
tolerated or recognized as worthy of attack 
(Chapanis and Chapanis, 1964; Lott, 1963; 
Mowrer, 1963) by psychologists in general. It 
seems that on the broad dimension of theoriz- 
ing, like rationality-irrationality, there is a con- 
tinuing slow oscillation between the two poles. 
At one time most of the theorists are gathered 
towards one pole using that notion as a point 
of departure for their thinking about behavior. 
In time, the heuristic potential of that pole 
becomes exhausted and novel predictions and 
nonobvious hypotheses get harder to come by 
As this happens, the natural corrective sets in, 
and the precursors of the new establishment 
swing to the opposite pole to continue the up- 
ward and onward sweep of psychological 
knowledge from that new vantage point. 

It is of some parenthetical interest to me 
that this swing back and forth does not seem 
to be correlated across disciplines. Take psy- 
chology and philosophy. The 1925-1950 period 
was one in which the dominant psychological 
interest and approach involved the nonrational 
nonintegrated themes of isolated habits, non- 
phenomenal, noncognitivistic constructs, auto- 
matic and compartmentalized behaviors, etc. 
At the same time in philosophy, we witnessed 
unusual interest in systematics, symbolic logic, 
logical empiricism, the study of systematic the- 
ory, etc. Since 1950 psychology has been shifting 
toward an interest in integration and rational- 
ism; we see the opposite shift in philosophy, 
with the dominant theme there of irrational 
man (Barrett, 1958). The voices we hear from 
the philosophical circles are no longer Hegel 
or Kant, or even the Wienerkreis but rather 


those of Kierkegaard, Husserl, Heidegger, 
Marcel, Merleau-Ponty (my own favorite), 
Jaspers, Sartre, etc. It appears that while philos- 
ophy yins, psychology yangs. One field's classic 
period tends to be another's romantic period. 

Where are we today in this oscillation be- 
tween the romantic and classic? I personally 
would venture the guess that we are at or near 
the quantitative peak of interest for this swing 
of the pendulum, but I suspect research activity 
will be sustained at the present level for at 
least five more years. I do see, however, the 
inevitable correction already setting in. This 
antithetical trend I shall call "complexity 
theory." 

It is an interesting phenomenon that while 
many bright young men have been drawing 
heuristic inspiration from the consistency ap- 
proach another group of men, a little younger, 
not so numerous, but perhaps just as bright, 
have been drawing their inspiration from the 
rather opposite complexity approach which, if 
not logically contradictory to the consistency 
theory, is at least the reverse of it in psycholog- 
ical flavor. 

Under the unsatisfactory rubric of "complex- 
ity theories" I include the diverse approaches 
reviewed in such recent books as Berlyne 
(1960), Fiske and Maddi (1961), and Fowler 
(1965). Here we come upon a romantic notion 
of the organism, very different from the classic 
notion of consistency theories. The latter classic 
organism has a penchant for stability, redun- 
dancy, familiarity, confirmation of expectance, 
avoidance of the new, the unpredictable. Com- 
plexity theory's romantic organism works on a 
quite different economy. It has a stimulus hun- 
ger, and exploratory drive, a need curiosity. 
It takes pleasure in the unexpected, at least in 
intermediate levels of unpredictability. It 
wants to experience everything; it shows alter- 
nation behavior; it finds novelty rewarding. 

It is an interesting incident for the sociolo- 
gists of science that these two rather opposed 
themes of consistency need and complexity 
need have coexisted in the same field with very 
little hard feelings and, unfortunately, very 
little confrontation. Like that earlier-mentioned 
contributor to the Sikh scripture, I feel that 
even when the two approaches are contradic- 
tory, both are true. The resolution is en- 
capsulated in a hypothetical non-monotonic 
relationship between our dependent and inde- 
pendent variables with the inflection point for 
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the behavior lying at an intermediate level of consistency theorists. However, the chapter is 


our consistency manipulation. already long, and the claims of justice must 
I am tempted to go further into the work yield to the weakness of the flesh and so we 
of the complexity theories, presenting their re- must here bring these extended remarks to a 
search in as much detail as I have given to the close. 
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46. Cognitive Dynamics in the Conduct of Human Affairs 


CHARLES E. OSGOOD 


Over the past two decades a great deal of social- 
psychological research has been converging on 
a conclusion about human thinking that com- 
mon sense had already isolated as the consis- 
tency which is the “hobgoblin of little minds.” 
It appears, however, that "consistency" can 
plague big minds as well as little, in high 
places as well as low. Indeed, the difficulties 
we face today on both national and interna- 
tional levels can be traced, in part at least, to 
these dynamics of human thinking. Research 
that is relevant to our problem cuts a wide 
swathe through the social sciences—attitude 
formation and change; the effects of context 
upon the interpretation of both perceptual and 
linguistic signs; interpersonal perception and 
group dynamics; the interactions among beliefs, 
decisions, and social behavior; and even public 
affairs. The researchers have come from a di- 
versity of theoretical molds. Accordingly, the 
purposes of this paper are to provide a brief 
purview of this research on cognitive dynamics, 
to indicate the essential similarities in the theo- 
retical notions that have been proposed, and 
to point up the significance of such cognitive 
dynamics for contemporary human affairs. But 
first we need a few specimens of the phenom- 
ena we wish to study. 

Specimen 1: International affairs. Before the 
delegates to the United Nations Khrushchev 
makes sweeping proposals for world disarma- 
ment. A large segment of the American press 
editorializes about the deceptive nature of these 
proposals, that, rather than sincere overtures 
toward peaceful solutions of problems, his pro- 
posals are carefully planned moves in the Cold 
War. It is cognitively inconsistent for us to 
think of people we dislike and distrust making 
honest, conciliatory moves, behaving as human 


* Reprinted from Public Opinion Quarterly, 1960, 
24, 341-365, with permission of the author and the 
publisher, 
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beings ought to behave, and assuming noble 
postures. 

Specimen 2: Internal affairs. A noved counter- 
spy is invited to speak in the high school audi- 
torium of a university town. In the course of 
his talk, he emphasizes the fact that the univer- 
sity hasn't invited any anti-Communists (includ- 
ing himself) to make public appearances in its 
halls. Although no allegations are directly made, 
many people in his audience are led to con- 
clude (a) that the university has invited pro- 
Communists (which it hasn't), or at least (b) 
that the university must include some powerful 
Communist supporters. 

Specimen 3: Individual behavior and belief. 
Some time after stories in the mass media about 
the relation between smoking and lung cancer 
had saturated the public, a survey in Minneap- 
olis inquired about both the smoking habits of 
respondents and whether they thought the rela- 
tionship between smoking and lung cancer had 
been proven or not proven. The results showed 
that 29 per cent of nonsmokers, 20 per cent of 
light smokers, but only 7 per cent of heavy 
smokers believed it had been proven. It is cog- 
nitively inconsistent to believe one way and be- 
have another; people who smoke heavily find it 
easier to disbelieve information that it is dam- 
aging to their health.1 ; 

Specimen 4: Interpersonal affairs. Fraternity 
men were asked to (a) name the men in their 
group they liked best and liked least, (b) rate 
themselves on a series of traits, and (c) rate the 
other men on the same series of traits. The re- 
sults showed that these men assumed greater 
similarity in personality traits between people 
they liked and themselves than actually existed. 


1 Leon Festinger, A Theory of Cognitive Disso- 
nance, New York, Row, Peterson, 1957, p. 155. 

2 F. E. Fiedler, W. G. Warrington, and F. J. Blais- 
dell, “Unconscious Attitudes as Correlates of Socio- 
metric Choice in a Social Group,” Journal of 


It has also been shown that husbands and wives 
attribute more similarity between them than 
actually exists, and this is more true for happily 
married than for unhappily married couples? 
It is "natural" to assume that people we like 
must think and feel as we do—at least, to the 
extent that we like ourselves. 

Specimen 5: Making inferences about people. 
If we observe, or are told, that so-and-so is in- 
telligent and considerate, and this is all the in- 
formation we have, we are nevertheless able to 
generate many inferences about him—he is also 
likely to be sensitive, socially adept, alert, and 
so forth, we assume. The traits we infer are not 
haphazard: they are generated from the region 
of intersection of the meanings of the traits we 
know about, according to laws of cognitive in- 
teraction. Many of the predictions we make 
about people and the expectations we have of 
them are based on inferences of this type. 

Specimen 6: Perceptual affairs. In his Film 
Technique and Film Acting? Pudovkin de- 
scribes a little experiment in film editing. A 
simple, passive close-up of the well-known Rus- 
sian actor, Mosjukhin, was joined to three dif- 
ferent strips of film. In one this close-up was 
followed by a shot of a bowl of soup on the 
table; in another it was followed by shots show- 
ing a dead woman in a coffin; in the third it 
was followed by shots of a little girl playing 
with a funny toy bear. The effects on an unsus- 
pecting audience were terrific, according to Pu- 
dovkin. “The public raved about the acting of 
the artist. They pointed out the heavy pensive- 
ness of his mood over the forgotten soup, were 
touched and moved by the deep sorrow with 
which he looked at the dead woman, and ad- 
mired the light, happy smile with which he sur- 
veyed the girl at play. But we knew that in all 
three cases the face was exactly the same.” 

Specimen 7: A matter of naming. “A rose by 
any name would smell as sweet,” we have been 
told—but would it? I do not know of any ex- 
periments on the influence of labels upon per- 


Abnormal and Social Psychology, Vol. 47, 1952, 
Pp. 790-796. 

3 M. G. Preston, W. L. Peltz, E. H. Mudd, and 
H. B. Froscher, "Impressions of Personality as a 
Function of Marital Conflict," Journal of Abnormal 
and Social Psychology, Vol. 47, 1952, pp. 326-336. 
4 J. S. Bruner and R. Tagiuri, “The Perception of 
People,” in G. Lindzey, editor, Handbook of Social 
Psychology, Vol. 2, Cambridge, Mass., Addison- 
Wesley, 1954. 

5 V. I. Pudovkin, Film Technique and Film Acting, 
London, Vision, 1954. 
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ception of odors, but this should be easily 
demonstrated, smells being the elusive, subjec- 
tive business they are. But from the myth of 
suburbia we can gather many examples of the 
same sort—the cramped, standardized, insignifi- 
cant little house with its postagestamp yard in 
“Briarwood Valley," in “Larchmont Hills,” or in 
"Sunnyvale Downs" somehow assumes a splen- 
dor and grace it could never have in “South- 
side Brighton Avenue, subdivision No. 7." The 
cheap pancl.and-paste bedroom set, produced 
by the hundreds of thousands but garnished 
with the name “Beverley Charm by Rudet,” 
acquires a distinction far beyond its cost and 
worth. 4 
Specimen 8: Attitudinal affairs. Suppose that 
we are favorably disposed toward Eisenhower, 
both as a person and as the President of our 
country. In Uruguay, let us say, he is greeted 
with flowers and smiles, but in Paraguay an 
unruly mob of students boos him and has to 
be dispersed with tear gas. Having little infor- 
mation, and generally neutral attitudes, toward 
both Uruguay and Paraguay, we find ourselves 
considerably more favorable toward the former 
than toward the latter. Subsequent news that 
Uruguay lives under a harsh dictatorship will 
be discounted—it must really be “benevolent” 
—and the fact that Paraguay has a democratic 
form of government, much like our own, is 
somehow difficult to assimilate. As this hypo- 
thetical example shows, we strive to maintain 
internal consistency among our attitudes and 
beliefs, often at the price of doctoring reality. 
So much for specimens of cognitive interac- 
tion in human affairs. Such examples could be 
elaborated ad infinitum. The important thing 
is that they are all cut from the same cloth; 
they are all instances of a basic dynamism ac- 
cording to which human judgment, belief, per- 
ception, and thought are transformed in 
midflight, so to speak. 


SOME THEORY OF COGNITIVE 
INTERACTION 


Insight into the dynamics of human think- 
ing has been available in the writings of bril- 
liant men of all periods. Certainly Aristotle was 
aware of these dynamics when he dealt with the 
principles of rhetoric; Shakespeare imposes the 
rules of psycho-logic® on the thought and be- 


6 This term has been used by R. P. Abelson and 
M. J. Rosenberg in their article, “Symbolic Psycho- 
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havior of his characters; and Machiavelli could 
not have had the understanding he did of poli- 
tics without an intuitive grasp of the same rules. 
But intuitive grasp and common sense—essen- 
tial though they may be to discovery in science 
—are not the same thing as explicit and testable 
principles of human behavior. 

Among psychologists who have dealt with cog- 
nitive interaction in recent times, Fritz Heider 
undoubtedly has given the earliest and richest 
analysis, in his two papers in the middle forties? 
and particularly as elaborated in his new book, 
The Psychology of Interpersonal Relations,§ 
which is a much broader study of human per- 
ception and thinking than the title implies. 
Working in the area of human communication, 
particularly in small groups, Theodore New- 
comb has utilized very similar theoretical 
notions to Heider's.? In his A Theory of Cog- 
nitive Dissonance, Leon Festinger has probably 
given the clearest statement of this type of 
theory and, through his own ingenious experi- 
ments, has extended it into the whole area of 
relations between cognitions and overt be- 
havior.!? Osgood and Tannenbaum, working in 
the area of attitude change, have presented what 
they call “the congruity hypothesis"—which 
again has similar features to the Heider-New- 
comb-Festinger approaches.!! The most explicit 
statement of this hypothesis appears in The 
Measurement of Meaning in the context of 
Osgood's mediation theory of meaning and the 
measurement procedures of the semantic differ- 
ential!? Rather than try to describe each of 
these theories in isolation, it will be more useful 
to describe them comparatively in terms of cer- 
tain common and differential features. 


logic: A Model of Attitudinal Cognition," Behav- 
ioral Science, Vol. 3, 1958, pp. 1-13. 

7 F. Heider, “Social Perception and Phenomenal 
Causality,” Psychological Review, Vol. 51, 1944, 
pp. 358-374. F. Heider, “Attitudes and Cognitive 
Organization,” Journal of Psychology, Vol. 21, 1946, 
pp. 107-112, 

8 F. Heider, The Psychology of Interpersonal Re- 
lations, New York, Wiley, 1958. 

® T. M. Newcomb, "An Approach to the Study of 
Communicative Acts," Psychological Review, Vol. 
60, 1953, pp. 393-404. 

10 Festinger, op. cit. 

n C, E. and P. H. Tannenbaum, “The 
Principle of Congruity in the Prediction of Atti- 
tude Change," Psychological Review, Vol. 62, 1955, 
pp. 42-55. 

12 C. E. Osgood, G. J. Suci, and P. H. Tannenbaum, 
The Measurement of Meaning, Urbana, Il, Uni- 
versity of Illinois Press, 1957. 


l. Cognitive Modification Results from the 
Psychological Stress Produced by Cognitive 
Inconsistencies 


We have here a kind of motivation, analogous 
to other drive states like hunger, sex, and anx- 
iety, but purely cognitive in origin. It is neces- 
sary, of course, to define the states of cognitive 
"consistency" and "inconsistency" and in terms 
as close as possible to observables. Heider him- 
self speaks in terms of balance and imbalance 
(from within a gestalt framework), but he does 
not provide us with a very clear statement, be- 
yond the fact that “a balanced state is... a 
situation in which the relations among the en- 
tities fit together harmoniously; there is no stress 
toward change.”13 In order to give meaning and 
order to the flux of distal stimuli (things, per- 
sons, events), the individual strives to maintain 
balance among the proximal signs (cognitions) 
of these external affairs.14 

Newcomb sees human communication as a 
means of achieving or maintaining symmetry in 
the orientations of individuals with respect to 
objects or events.15 Festinger's theory is ex- 
pressed in terms of consonance and dissonance. 
These terms refer to the relations which may 
exist between pairs of cognitive elements (bits 
of knowledge about the world, other people, the 
self, one's own behavior).16 


13 Heider, The Psychology of Interpersonal Rela- 
tions, p. 201. 

14 Thus, if person P likes another person O, and 
desires a certain object X, it is fitting (balanced) 
that he strive to be with the other person and ac- 
quire the object, that he assume that the other 
person also likes him and that the object is of 
value intrinsically, that he assume the other person 
also likes the object he finds desirable, and so forth. 
States of imbalance, e.g., when P discovers that an 
admired O does not like X the way he does, pro- 
duce stress and tension; it is these stresses that 
generate cognitive dynamics. 

15 In Newcomb's system the minimal communica- 
tive act is that where A communicates to B about 
some event, X. The important variables are A's 
orientation (attitude and meaning) toward X, A's 
orientation toward B, B's orientation toward X, and 
B's orientation toward A, each such orientation 
conceived as being potentially positive or negative 
in sígn. To the extent that A's behavior with re- 
spect to X depends upon B's orientations, and to 
the extent nonsymmetrical relations exist, there 
will be stress toward communicating about X, on 
the one hand, and toward increasing or decreasing 
the cohesiveness of the group relation between A 
and B, on the other. 

16 Cognitive elements are consonant when one im- 
plies the other logically (e.g. knowledge that it is 
raining out and knowledge that you are putting on 
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Where Heider has not attempted a formal 
definition of balance in his system, Cartwright 
and Harary have done so in terms of the mathe- 
matical theory of linear graphs.17 To handle 
Heider's types of situation in graph theory 
requires, according to these authors, signed, di- 
rected graphs of type 2. We start with the cog- 
nitive elements (people, objects, etc.) defined as 
points. If these elements are involved in inter- 
actions (and they need not be, cf. section 2 
below), then we connect them with a line; the 
line must be directed by means of an arrow in 
order to take into account Heider's distinction 
between agent and recipient (e.g., in Figure 14, 
it is P who likes O, the reverse relation not 
being given). Since Heider talks about both 
sentiments (liking vs. disliking) and cognitive 
units (belonging vs. not belonging) as being 
two-valued, we also require a signed line— 
which Cartwright and Harary accomplish by 
means of solid vs. dashed lines. Finally, since 
Heider distinguishes the two types of relations, 
"liking" vs. "belonging," it is suggested that a 
type 2 graph, using two colors, for example, 
should be employed. However, Cartwright and 
Harary make little use of this last distinction 
and, in fact, criticize the ambiguity in Heider's 
"cognitive unit" conception (see section 2 be- 
low).18 In a more recent paper Harary has fur- 
ther elaborated this type of analysis, with special 
reference to the measurement of structural 
balance in small groups.!9 


your raincoat), dissonant when the obverse of one 
would follow from the other (e.g, knowledge that 
it is raining and planning to go on a picnic), and 
irrelevant when one implies nothing about the 
other (e.g., knowledge that it is raining and knowl- 
edge that you were married in 1939). Although 
analyzed in pairs, cognitive elements are assumed 
to exist in clusters, presumably on the basis of 
either external patterning of signs in the environ- 
ment or internal patterning of meaningful associa- 
tions. Dissonant elements in such clusters create the 
stress toward cognitive modification. 

17 D. Cartwright and F. Harary, "Structural Bal- 
ance: A Generalization of Heider's Theory," Psy- 
chological Review, Vol. 63, 1956, pp. 277-298. 

18 A number of deductions about balanced states 
are derived from this model: (1) An S-graph is bal- 
anced if and only if all paths joining the same pair 
of points have the same sign. In Figure 14 the 
relation between M and N is balanced; it would 
not be if M liked N, but not vice versa. (2) Any 
Cycle within an S-graph is balanced if it contains 
an even number of negatively signed lines (0, 2, 4, 
etc.); otherwise it is unbalanced. A cycle is any re- 
cursive path through any set of points connected 
by lines. The M-N-X cycle in Figure 1A is bal- 
anced; the P-O-X cycle is not. (3) A structure 


This brings up a source of confusion in this 
field that has hardly been recognized, as far as 
I am aware, but must be cleared up before we 
proceed. Cognitive interactions, obviously, tran- 
spire within the nervous systems of single indi- 
viduals. The “maps” we draw to represent such 
interactions necessarily reflect person-person and 
person-object relations as some individual per- 
ceives them. Festinger is clearly aware of this; 
Heider also seems to be working on this basis— 
to the extent that his book might better have 
been titled “The Psychology of Interpersonal 
Perceptions.” But both Newcomb and Cart- 
wright and Harary seem to shift too easily from 
the subjective (cognitive interactions in indi- 
viduals) to the objective (group structure and 
dynamics) frame of reference. Now it may be 
that the laws which apply to the interactions, 
stresses, and resolutions among the cognitive 
processes of individuals can be directly trans- 
ferred to interactions within groups of people 
— where persons are the elements rather than 
cognitions—but this remains to be proven. The 
bridge between the two levels presumably lies 
in the fact that the structuring of a group de- 
pends upon the cognitive "maps" individual 
members have of it. 

Osgood et al.20 equate "cognitive elements" 
with the meanings of signs, and these are in- 
dexed in terms of n bipolar dimensions or fac- 
tors. However, since interactions are assumed to 
occur on each dimension independently of the 
others, we may restrict our attention to the dom- 
inant evaluative factor of the meaning space— 
which is the one which has interested all other 
investigators. In the measurement system pro- 
vided by the semantic differential, the evalua- 
tive factor runs from +3 (extremely good) 
through 0 (neutral) to —3 (extremely bad). The 
evaluative meaning of a concept (cognitive ele- 
ment) is its location along such a scale; the 
polarization of a concept is its distance from 
0, regardless of sign. Now, it is assumed that 
evaluative meanings are mediated by a repre- 
sentational reaction system (perhaps here the 
autonomic nervous system and its connections 
with the central nervous system) which can only 


theorem states that an S-graph is balanced if and 
only if its points can be separated into two mutu- 
ally exclusive subsets such that cach positive line 
joins two points of the same subset and each nega- 
tive line joins points from different subsets. 

19 F. Harary, "On the Measurement of Structural 
Balance," Behavioral Science, Vol. 4, 1959, pp. 316- 
323. 

20 Osgood, Suci, and Tannenbaum, of. cit. 
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do one thing, assume one “posture,” at one 
time. It must follow, therefore, that if two (or 
more) signs associated with different evaluative 
meanings occur near-simultaneously, only one 
cognitive reaction can occur in the system, and 
this must be a compromise. According to Os- 
good et al., congruity exists when the evaluative 
meanings of interacting signs are equally polar- 
ized or intense—either in the same or opposite 
evaluative directions (see section 2 on types of 
assertions). To the extent that there are differ- 
ences in polarization, some degree of incongru- 
ity must exist to be resolved in the process of 
cognizing these signs. 

Although this theory will be shown to lead to 
similar conclusions about human thinking, it 
developed from a very different conceptual 
background than the others we have been con- 
sidering, and certain critical differences should 
be noted at this point. First, it attributes de- 
grees of incongruity to single pairs of elements 
rather than the all-or-nothing relations found 
in Heider and Festinger. Coupled with a mea- 
suring device like the semantic differential, this 
can lead to more refined predictions. Second, 
it assigns affective or attitudinal values to the 
cognitive elements themselves, and not to their 
relations, whereas Heider, at least, assigns both 
affective and connecting properties to the rela- 
tions between cognitive elements. This double 
function of the relational variable is, to my 
mind, the major weakness in Heider's theory. 

This distinction becomes clear in connection 
with the signed digraphs of Cartwright and 
Harary. The positive sign of the line P likes O 
in Figure 14 says nothing about the evaluations 
of either P or O per se—a bowery bum may be 
perceived as liking a slut by the particular Ego 
for whom our map is constructed! Now, giving 
the elements values may be unnecessary when 
dealing with group structures (where the points 


represent people), but when we are dealing with 
cognitive interactions (and the points represent 
concepts) we know that the elements have vary- 
ing values. When I see P dancing with O, and 
O is my girl or wife, the impact of this cogni- 
tion obviously depends upon my attitude to- 
ward P (to say nothing of my attitude toward 
O), quite apart from the cognitive implications 
of dancing with. 

Figure 1B suggests an alteration of the Cart- 
wright-Harary graphic technique that may take 
account of this added property of cognitive 
maps. Let us enlarge each point to a circle and 
in it place a sign corresponding to the evalua- 
tive meaning of the cognitive element repre- 
sented, +, 0 or —. Then, since we will later 
wish to use dashed lines to represent inferred 
relations (not given in the verbal or behavioral 
data), let us use only solid lines for the relations 
between elements and sign them by means of 
+ or —. The basic test for balance (or con- 
gruity) of a cognitive structure must now be 
changed: A cognitive structure is congruent if, 
and only if, all its included assertions have 
either zero or two negative signs—where an 
“assertion” is any pair of cognitive elements 
connected by a line. Applying this test to the 
graph in Figure 1B, we can see that in this cog- 
nitive map the only assertion that fails to be 
congruent is that where +O is in favor of —X. 
There may be other, more complex, laws of bal- 
ance or congruence derivable from this model, 
but I have not yet explored this possibility. 

Is this elaboration of the Cartwright-Harary 
model psychologically necessary? The following 
seems to be a crucial test: Imagine a three- 
element triangular system in which all three 
pathways are signed +. Such a cycle would have 
to be cognitively balanced in the Cartwright- 
Harary system (as well as for Heider). It would 
also have to be congruent in my system, pro- 


Figure 1 
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vided that the three elements were either all 
positive or all negative (e.g., "I love Mommy, 
Mommy loves Daddy, Daddy loves Me"). But 
what if one of the elements is negative? Here 
we would have two "good" people, both liking 
each other, but also liking an evil person, ob- 
ject or act (e.g, I love Mommy, Mommy loves 
a murderer, I love a murderer, etc.). If we agree 
that it is psychologically incongruent for a 
"good" thing to be positively associated with a 
"bad" thing, then the Cartwright-Harary model 
is untenable without elaboration. 


2. If Cognitive Elements Are to Interact, They 
Must Be Brought into Some Relation with One 
Another 


Contiguity is a necessary, but not a sufficient, 
condition for interaction among cognitive ele- 
ments. In "Tom is a thief; Paul will catch him," 
"Paul" is spatially and temporally closer to 
"thief" than is “Tom,” but the structure of the 
sentence brings "Tom" and "thief" into inter- 
action. We have varying attitudes toward myriad 
people, things, and events, many of them po- 
tentially incongruent, imbalanced, or dissonant 
as one’s theory would have it, but these cogni- 
tions are not continuously interacting—only 
when they are brought together in some way. 

Festinger’s theory fails to give explicit recog- 
nition to this variable. Implicit recognition of 
the need for linkage in some unit appears in 
the design of his experiments, however—disso- 
nance only occurs when a person has been 
forced to make a choice between two gifts, 
when he has been exposed to information con- 
sistent or inconsistent with his beliefs, and so 
on. Analysis into "relevant" and "irrelevant" 
pairs of cognitive elements is not sufficient; cog- 
nitions of the attractiveness of an electric 
toaster and an electric clock are always poten- 
tially relevant, but only become effectively rel- 
evant when, by forced choice, the implied 
negative assertion that one is better than the 
other is made. 

Heider, on the other hand, does give explicit 
recognition to this variable,?! treating it in 
terms of gestalt perceptual factors. "Separate 
entities comprise a unit when they are perceived 
as belonging together. For example, members 
of a family are seen as a unit; a person and his 
deed belong together." Such factors as similar- 
ity, proximity, common fate, good continuation, 
set and past experience are cited as contribut- 


71 Heider, The Psychology of Interpersonal Rela- 
tions, pp. 176%. 


ing to the formation of cognitive units, but it 
is evident in his examples that much more than 
perceptual organization in the traditional sense 
is involved. The difficulty with Heider's analysis, 
as Cartwright and Harary have implied, is that 
belonging in a unit (U or not U) is given the 
same status as liking (L vs. DL), both as rela- 
tions between cognitive elements. Thus the 
triad, P worships O (P L O), O told a lie (O 
U X), and P disapproves of lying (P DL X), is 
said to be unbalanced because it has only one 
negative relation. One could also say that since 
O is + in evaluation and X is — in evaluation 
for P, the single assertion that O told a lie (i.e., 
+O + —X) is itself cognitively unbalanced or 
dissonant for P. 

Osgood et al. make an absolute distinction 
between structure and content in the represen- 
tation and analysis of cognitive interactions. In 
order for two cognitive elements to interact, 
they must be related in some kind of assertion. 
Assertions may be linguistic (‘Eisenhower fa- 
vors Big Business,” as read in an editorial) or 
behavioral (a picture of Eleanor Roosevelt pat- 
ting the head of a little colored boy), and they 
may be either associative (X favors, likes, owns, 
is a member of, etc., Y) or dissociative (X at- 
tacks, dislikes, throws away, is excluded from, 
etc., Y). But whether a particular cognitive pat- 
tern is congruent or not depends on both the 
structure and the content. Thus, in contradis- 
tinction to Heider, the assertion P likes O 
merely indicates an associative or positive struc- 
ture; if both P and O have the same sign evalu- 
atively, the assertion is congruent (e.g., “God is 
on our side” and “The Devil aids the enemy” 
are both congruent assertions), but if P is + 
and O —, or vice versa, then P likes O becomes 
incongruent (e.g, "God is with the enemy"). 

Although it is significant that Heider came 
to his theory via a very penetrating study of the 
ordinary language of human relations—which 
he calls a "naive psychology"—it does not seem 
to me that he has fully explored the possibilities 
in rigorous linguistic (structural) analysis. Os- 
good, Saporta, and Nunnally?? have developed 
a technique for abstracting the affective content 
of messages, called evaluative assertion analysis: 
(1) Objects of attitude are isolated from com- 
mon-meaning terms and then masked by substi- 
tuting nonsense letter pairs like AZ, BY, and 
CW for them. (2) Complex utterances in the 
22 C. E. Osgood, S. Saporta, and J. C. Nunnally, 


“Evaluative Assertion Analysis," Litera, Vol. 8, 1956, 
pp. 47-102. 
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masked message are broken down into compo- 
nent assertions of the actor-action-complement 
form. (3) Evaluative weights are given to the 
symbols for attitude objects in terms of their 
structural relations to evaluative common-mean- 
ing terms and other attitude objects. (4) The 
attitudinal consistency of the original utterances 
is checked by applying the congruity test (an 
even number of negative signs in each assertion). 

Take, for example, the masked sentence, “AZ 
attacks the expansionist ambitions of both BY 
and CX.” This breaks into three component as- 
sertions: /AZ/attacks/expansionist ambitions], 
/BY/has/expansionist ambitions/, and /CX/ 
has/expansionist ambitions/. The circles and 
solid lines in Figure 2 provides a signed digraph 


Figure 2 


of the structural relations actually given. How- 
ever, if one is also given the valence of the com- 
mon-meaning cognitive element, “expansionist 
ambitions,” he can fill in by the rules of infer- 
ence of psycho-logic what the valences of all of 
the attitude-object circles must be: AZ must be 
+, BY must be —, and CX must be —, other- 
wise we would have an assertion line with an 
odd number of negatives.23 But note that we 
still cannot decide whether or not this is a con- 
gruent or incongruent pattern until we know 
that, in fact and for some particular language 
user, AZ is + in evaluation, BY is —, and so 
forth. If AZ (e.g., Franco Spain) is in fact some- 
what negative for a person, then it is incongru- 
ous psycho-logically for it to be “attacking” 
something (e.g, the “expansionist ambitions") 
belonging to two bad things (e.g., Russia and 
the Arab League), and he is under some pres- 
sure to modify his cognitive map. 

The process of inference through psycho- 
logic—hence the possibility of predicting the 
23 It should be pointed out that not all syntactical 
structures allow such inference of concept valences. 


effects of implicit assertions in linguistic and 
behavioral situations—is also illustrated in Fig- 
ure 2 by the dashed lines. Given that AZ is +, 
BY is —, and CX is — (i.e, a balanced situa- 
tion), even though nothing is said about other 
relations, we can predict (1) that AZ should be 
against BY and vice versa, (2) that AZ should 
be against CX and vice versa, and (3) that BY 
should be in favor of CX and vice versa. Much 
of what is communicated attitudinally by mes- 
sages and by behavior is based on such infer- 
ences; that is, of course, the chief tool of the 
propagandist, the technique of "innuendo." 
'The syntax of language and of behavior pro- 
vides a structural framework within which 
meaningful contents are put; the structure indi- 
cates what is related to what, and how, but only 
when the meaningful values are added does the 
combination of structure and content determine 
psycho-logical congruence or incongruence. 

It was Abelson and Rosenberg?! who intro- 
duced the term “psycho-logic” and contrasted it 
with “logic.” They give a number of the rules 
of psycho-logic rule I—A likes B and B likes C 
implies that A likes C also; rule 2—A likes B 
and B dislikes C implies that A dislikes C; rule 
3—A dislikes B and B dislikes C implies that 
A likes C, and so forth. It is important to point 
out that the inferences from psycho-logic are 
not necessarily invalid; they are simply illogical. 
For example, if Khrushchev were to indicate 
that he favors a particular American presiden- 
tial hopeful—the so-called "kiss of death"— 
many people would, psycho-logically, become 
suspicious of this candidate. Now, Khrushchev's 
support might be valid grounds for suspicion, 
but the inference is not logically necessary. 

Abelson and Rosenberg also make an asser- 
tional analysis of linguistic statements, very 
much as Osgood, Saporta, and Nunnally have 
done, and they extend this type of analysis to 
interlocking sets of statements on the same 
topic. If the connectives (positive, p, and nega- 
tive, n) in the following assertions—" Im for 
having coeds at Yale," “J want good grades,” 
and “Having coeds would undoubtedly inter- 
fere with getting good grades""—are arranged in 
a matrix in this fashion, 


Good 
Ego Coeds Grades 
Ego p p p 
Coeds p p n 
Good grades p n p 


24 Abelson and Rosenberg, op. cit. 
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the degree and locus of incongruity can be 
clearly indicated. Since we cannot change the 
signs of any corresponding row and column in 
the matrix (always leaving the diagonal entries 
signed p) and thereby reduce the number of n's 
in one half of the matrix, the minimum com- 
plexity or imbalance is 1. If the first statement 
were changed to “I am against having coeds at 
Yale," similar analysis shows that all entries 
could be changed to the same sign, and hence 
the matrix of statements would then be con- 
gruent. This approach is similar to that of 
Cartwright and Harary, but is based on matrix 
algebra rather than the mathematical theory of 
graphs. 


3. Magnitude of Stress Toward Modification 
Increases with the Degree of Cognitive 
Inconsistency 


Most of the theories we have been consider- 
ing express this relation, but it only becomes 
useful when a theory permits quantification. 
Heiders own statements remain essentially 
qualitative. Newcomb, on the other hand, states 
his principles in quantitative form, eg., “the 
stronger the forces toward A's co-orientation in 
respect to B and X, ... the greater A's strain 
toward symmetry with B in respect to X . . . ,"25 
but the units in which "forces toward co-orien- 
tation" and "strain toward symmetry" are to 
be measured remain obscure. Festinger specifies 
that the magnitude of dissonance increases (1) 
with the importance of the dissonant elements 
and (2) with the weighted proportion of all 
elements in a cognitive cluster that are disso- 
nant. But again it is not clear how "importance" 
or "weighted proportion of elements that are 
dissonant" are to be quantified. 

Association of the Osgood and Tannenbaum 
congruity hypothesis with the semantic differ- 
ential provides quantification of cognitive 
inconsistency, but it also limits the types of situ- 
ation that can be studied. Total pressure to- 
ward congruity, P, is stated to be equal to the 
difference between the initial scale position of 
a cognitive element (prior to interaction) and 
its location of perfect congruence (under the 
conditions of interaction). Perfect congruence 
between two elements was defined earlier as 
equal polarization (intensity), either in the same 
evaluative direction (for associative assertions) 
or in opposite evaluative directions (for dis- 
sociative assertions). Thus, for a hypothetical 


?5 Newcomb, of. cit. 


individual for whom Eisenhower is +3 and 
Uruguay is +1, the assertion “Eisenhower was 
greeted with flowers in Uruguay" would be con- 
gruent if Eisenhower were only 4-1 or Uruguay 
were +3. We therefore have 2 units of pressure 
toward congruity, P, in this situation—and it 
may be noted in passing that the P's associated 
with the various interacting elements are always 
equal according to this hypothesis. 

Cartwright and Harary arrive at the degree 
of balance of an S-digraph by taking the ratio 
of the number of positive semicycles to the 
total number of semicycles (cf., above). They 
state that such an index makes it possible to 
deal with cognitive inconsistencies in probabil- 
istic and statistical fashions; they do not state 
any relation between degree of balance and the 
total stress toward consistency, however. Abel- 
son and Rosenberg define the complexity of 
imbalance as the minimum number of changes 
of relations necessary to achieve balance. In the 
example given earlier about having coeds at 
Yale, the sign of one of the three assertions 
would have to be changed to achieve psycho- 
logical consistency. Again, however, there is no 
statement to the effect that amount of stress 
toward consistency varies with the "complexity 
of imbalance." It is possible, of course, that 
total pressure toward cognitive modification 
does not vary in any simple way with degree 
of inconsistency. This may be the case particu- 
larly when only the structural relations and 
not the affective values of the contents are con- 
sidered. Both Festinger and Osgood, who ex- 
press the magnitude principle explicitly, deal 
with the properties of the cognitive elements 
themselves, not just their structural relation to 
each other. 


4. The Dynamics of Cognitive Interaction Are 
Such That Modifications Under Stress Always 
Reduce Total Cognitive Inconsistency 


We have here a kind of “mental homeosta- 
sis"26—cognitive inconsistencies set up pressures 
toward their own elimination. But we must say 
at the outset that modifications do not neces- 
sarily occur at all. People simply may not think 
about the matter or, as Festinger has shown, 
they may avoid exposing themselves to disso- 
nance-arousing information. Furthermore, as 
Heider points out, some people seem to be able 
to “live with it” or even actively seek cogni- 


26 R. Stagner, “Homeostasis as a Unifying Concept 
in Personality Theory,” Psychological Review, Vol. 
58, 1951, pp. 5-18. 
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tively disturbing situations; they are more “tol- 
erant of ambiguity,” to borrow an older phrase. 
Such tolerance probably increases with intelli- 
gence and education, and it certainly decreases 
under states of heightened ‘emotion. But even 
here there may be a larger consistency operat- 
ing: it may be intrinsically disturbing for some 
people to follow the simple-minded dictates of 
psycho-logic, at least to the extent they are 
aware of them. 

But let us assume that our subjects are ex- 
posed to inconsistency, that they are thinking 
about it, and that they are susceptible to psycho- 
logic—what are the alternative resolutions avail- 
able and how can we decide which will occur? 
We will find that theories in this field have 
been better at stating alternatives than at de- 
ciding among them. 

(a) The sign, or even existence, of a rela- 
tionship may be changed. For theories like 
Heider's that deal only with relations and not 
with the values of the included elements, this is 
the only type of resolution. However, since he 
deals with several types of relations, e.g., L vs. 
DL and U vs. not U, the situation is compli- 
cated. In the simplest diadic case, if P L O but 
also P not U O, either P may decide he doesn't 
like O or he may try to join O in some unit— 
what happens presumably depends upon the 
relative strengths of L vs. not U. In the triadic 
case of imbalance, where P L O, P L X, but O 
DL X, for example, as well as in the more 
complex cases treated by Cartwright and Ha- 
rary, any one of the relations may be changed 
to produce an even number of negatives, or 
none—but again, which one? Whether P will 
change toward disliking O or disliking X, or O 
change toward liking X, seems to require more 
information than the model itself provides. 
When new relations are brought into existence 
(Heider uses the term "induced"), they will be 
in balance with the existing structure, e.g. if 
P L O, the relation O L P tends to be inferred. 
Conversely, relations may be wiped out of exis- 
tence in the interest of balance. 

(b) The sign, or even existence, of a cogni- 
tive element may be changed. Although Fest- 
inger does not attribute evaluations per se to 
cognitive elements, he does deal with changes 
of these elements in the direction of conso- 
nance. When dissonance between behavior and 
cognition exists, a person may (1) change his 
behavior so as to be consonant with his cogni- 
tion, (2) change his cognition so as to be con- 
sonant with his behavior, ($) add new support- 


ing elements to reduce the total dissonance, or 
(4) eliminate old dissonant elements to reduce 
the total dissonance. In the case of our heavy 
smoker confronted with information about lung 
cancer, for example, he may (1) take up filter 
cigarettes, (2) question the validity of the infor- 
mation, (3) seek information showing how much 
more likely one is to die in automobile acci- 
dents, or (4) avoid reading or thinking about it 
any more—or, of course, just live with it as 
many of us do! By making certain assumptions 
about the importance of different elements and 
other factors in particular experimental situa- 
tions, Festinger is able to select among alter- 
native resolutions. 

Osgood and Tannenbaum have formulated a 
general law governing interactions among cog- 
nitive events: Interacting elements are modified 
in inverse proportion to their intensity or po- 
larization, i.e., the congruity formula, 


$ [bal 
Ibil + [bo] 


maal P 


" [bs] + [ts] 


in which mc refers to meaning change on some 
dimension (e.g. attitude), p refers to polariza- 
tion (deviation from neutrality regardless of 
sign), P refers to total pressure toward. congru- 
ity (cf. section 3 above), and the subscripts ! 
and 2 refer to the interacting cognitive elements. 
Given the assertion, "Eisenhower was greeted 
with flowers in Uruguay," and known values of 
+3 for Eisenhower and --1 for Uruguay, we can 
predict that three-fourths of the attitude change 
should be exerted on Uruguay and only one- 
fourth on Eisenhower. Similarly, in Heider's 
triadic P L O, P L X, O DL X, imbalance situ- 
ation, if the attitude toward O were more 
intensely favorable than that toward X the for- 
mula would have to predict that most of the 
pressure in this dissociative assertion would be 
on making X much less favorable. The Ei- 
senhower/Uruguay example illustrates another 
characteristic of the congruity hypothesis—it 
predicts changes on a quantitative basis even 
where relations are qualitatively balanced (here 
a + associated with a +). 

However, some insufficiencies of the congruity 
hypothesis of Osgood and Tannenbaum appear 
when it is applied experimentally. First, al- 
though the associative or dissociative nature of 
assertions (relations) determines the direction 
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of congruence (and hence the sign of P) in this 
model, the formula deals entirely with the 
changes in the meanings of the cognitive ele- 
ments. Yet change in sign or even denial of the 
assertion itself is obviously one type of resolu- 
tion. The assertion that Eisenhower is a card- 
carrying Communist produces incredulity and 
is denied. In order to generate accurate pre- 
dictions in his attitude-change experiment, 
Tannenbaum had to apply a "correction for 
incredulity" that increased with the magnitude 
of P between values of 3 and 6, ie, highly 
incongruous assertions. Second, the formula 
gives no weight to the direction of an assertion 
(cf. Heider's distinction between agent and re- 
cipient)—P praises O has the same effects upon 
the elements as O praises P, as far as the for- 
mula is concerned. Yet Tannenbaum's evidence, 
where Sources come out for or against Concepts, 
suggests that a greater share of the impact is 
exerted upon the recipient than the agent. 
The inverse proportionality formula underlying 
the congruity hypothesis may be valid in the 
“other things equal” sense, but there are cer- 
tainly other factors to be considered. 

In a recent paper titled “Modes of Resolution 
of Belief Dilemmas,” Abelson presents a con- 
siderable variety of resolution types.?? He limits 
his consideration to the simplest diadic situa- 
tions, in which A is either associated with or 
dissociated from B by assertions, but follows 
Osgood et al. in attributing positive or negative 
values to the cognitive elements A and B, them- 
selves. His first type—which he unfortunately 
calls denial—includes both the major forms of 
resolution discussed above, (1) where the sign 
of the relation is changed and (2) where the 
sign of one or both of the cognitive elements 
is changed. The term "denial" implies an all- 
or-nothing quality and a consciousness of deci- 
sion which certainly do not apply to many of 
the interactions we would like to handle, €g. 
in perceptual modification or in semantic adjust- 
ments to word mixtures. But let us now look 
at some of these other resolution types. 

(c) Other cognitive elements that are in bal- 
anced relation with one or the other of the dis- 
sonant. elements may be adduced (bolstering). 
As Abelson points out, this does not eliminate 
the imbalance but tends to “drown it out." This 
is a type of resolution stressed by Festinger, €g., 
the heavy smoker who says that one is more 


27 R. P. Abelson, “Modes of Resolution of Belief 
Dilemmas,” unpublished paper, 1959. 


likely to die in an automobile accident, that he 
has a large chest expansion, that he wouldn't 
be among the unlucky 15 per cent anyway, and 
so forth. Abelson also points out that it is usu- 
ally the more strongly entrenched (attitudinally 
polarized?) element that receives bolstering: if 
one is a devout Catholic, yet intellectually agrees 
with the use of contraceptives, he is likely to 
react to the Church's condemnation of this prac- 
tice by thinking of the Church's long history of 
being right, of God relying on him to exercise 
will power, and so on. 

(d) Other cognitive elements that are in a 
relation of imbalance with one or the other of 
the dissonant elements may be adduced (under- 
mining). Abelson does not distinguish this type 
of resolution from “bolstering,” perhaps because 
it is the converse. Here new cognitive ele- 
ments that have a dissociative (imbalance) rela- 
tion to the less firmly entrenched element are 
adduced—the familiar process of rationalization. 
Our devout Catholic above is likely to decide 
that contraceptives are physically injurious, that 
their use weakens a person’s will power, and so 
forth. Of course it must be recognized that 
adducing new relations, whether bolstering or 
undermining, must also produce shifts in the 
meanings of the elements involved 4 la con- 
gruity—which is presumably why Abelson says 
that, when failure of denial is followed by bol- 
stering techniques, this in turn is usually fol- 
lowed by a new phase of denial. 

In most real-life situations we are dealing 
with complex clusters of cognitive elements, 
each with its valence and each connected with 
others by an interlocking set of assertions, ex- 
plicit or implicit. Patterns of bolstering and 
undermining relations already exist, without 
need for adducing them. We may, following 
Abelson's analysis, suggest a general rule for 
resolution in complex cases: those cognitive 
modifications (changes in assertions or in ele- 
ment valences) will occur which require the 
minimum restructuring of the entire cognitive 
map. 

By way of illustration, let us look back at 
Figure 1B and verbalize a cognitive situation 
it might map: "I (P) am going around with an 
attractive young lady named O. She wants me 
to grow one of those filthy beards (X) that low- 
brow beatniks like M (whom I know and de- 
spise) and his friend N wear." The source of 
dissonance for this poor Ego can be shown by 
congruity analysis to lie in the assertion +O 
favors —X (beards). Now, if Ego were to grow 
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a beard (+P has —X), he would not only have 
to change his attitude toward beards (so that 
+P has + X), but he would also have to change 
his attitude toward beatniks M and N—or per- 
haps induce them to shave off their beards! The 
simplest solution, of course, involving changing 
the sign of only one assertion, is to get his lady 
friend to give up the beard-growing project 
(thus produce --O against —X), and we could 
confidently expect P to communicate vigorously 
(cf. Newcomb) with O along these lines. A sec- 
ond-best solution would be for this very proper 
Ego to change his evaluation of the girl, which 
in turn would require him to stop seeing her 
(+P not U —O) if he has any sense of consis- 
tency at all. This analysis is obviously insuffi- 
cient because we have not taken into account 
the relative evaluation weights of the elements; 
selection among these alternatives will also de- 
pend upon how much he likes O vs, how much 
he is against X—including all their supports in 
his total cognitive map. 

(e) One or the other of the dissonant cogni- 
tive elements may be split into two parts, these 
parts being of opposed valence and dissocia- 
tively related (differentiation). This would be 
another possible resolution for our disturbed 
Ego above—he might decide there are “distin- 
guished beards” and “pretentious beards” and 
then proceed to grow himself a “distinguished” 
one. Everything would then be in good cogni- 
tive order, with P, O, and X as one positive 
set opposed to M, N, and X’ as a negative set. 
It may be interesting to note that what is 
involved in differentiation is a denotative re- 
classification of cognitive units forced by a con- 
notative (affective) stress. Behaving the same 
way, having the same attitude, toward all mem- 
bers of a class (racial group, religious group, 
teachers, students, fried foods, Sport-car owners, 
cats, etc.) often runs us into cognitive conflicts 
of an affective, psycho-logical sort, and intellec- 
tual redifferentiation provides one way out. 

(f) Dissonant cognitive elements may be 
combined into a larger unit which, as a whole, 
is in balance with other cognitive elements 
(transcendence). Abelson gives as an example of 
this the typical resolution of the dissonance 
between “rational man” (A) and “spiritual 
man” (B): joined together in a larger unit, “the 
whole man,” they become congruent with posi- 
tive concepts (C) like "the full life," "a bal- 
anced education," and so forth. Or another 
example: Cain and Abel are always squabbling 
with each other; their father manages to avoid 


choosing between them by saying, "You are 
both my sons, and I love my sons." In itself, 
transcendence does not resolve incongruities— 
it merely hides them. However, as Heider has 
pointed out, simply being included within a 
common unit implies a consonant relation 
among the dissonant elements, and this must 
weaken the total dissonance—the father is im- 
pelled toward believing that Cain and Abel 
really love one another, and our educator is 
impelled toward believing that religion can be 
rationalized. 


SOME EVIDENCE ON COGNITIVE 
INTERACTION 


Here some indication will be given of the 
diversity of phenomena which appear to follow 
the rules of cognitive interaction. These rules 
may be summarized as follows: When, in the 
course of human thinking, inconsistent. cogni- 
tive elements are forced together by linguistic 
or behavioral assertions, stress is produced in 
proportion to the magnitude of the inconsis- 
tency, this stress producing cognitive modifica- 
tions which—by changing the nature of the 
assertion, changing the connotative meanings of 
the elements, differentiating or integrating the 
denotations of the elements—serve to re-estab- 
lish cognitive consistency. No attempt will be 
made to cover the experimental literature ex- 
haustively; it has become quite extensive. 


l. Attitude Change 


Whenever a person reads a book or news 
paper, listens to the belief statements of others, 
or even ruminates within his own storehouse of 
concepts, he is exposing himself to assertions 
which are likely to be incongruent to some de- 
gree with his existing frame of reference.?8 In 
experiments we try to measure some part of his 
existing attitude structure, produce messages 
which are congruent or incongruent to some 
definable degree, predict what the effects on 
him should be, and then measure his attitude 
structure again to determine the correctness of 
these predictions. 

Tannenbaum?? devised three supposed news- 


28 For a discussion of the relation between the af- 
fective and cognitive components of attitudes, sce 
M. J. Rosenberg, Public Opinion Quarterly, Vol. 
24, 1960, pp. 319-340. 

?9 P, H. Tannenbaum, "Attitudes toward Source 
and Concept as Factors in Attitude Change through 
Communications," University of Illinois, 1953, un- 
published doctoral dissertation. 
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paper messages in which a source (the Chicago 
Tribune, Senator Taft, labor leaders) comes out 
either in favor of or against a concept (abstract 
art, accelerated college programs, legalized gam- 
bling); the source/concept combinations (in the 
order given) were known by pre-test to divide 
college students about equally into 3-by-3 pat- 
terns, +S+C, +S—C, —S—C, etc., in terms of 
initial attitudes. On the basis of these original 
attitude measures, predictions of the direction 
and relative magnitude of attitude change were 
made, for both sources and concepts, by apply- 
ing the congruity formula. The direction of 
attitude change was as predicted in every case, 
and the correlation between predicted and ob- 
tained magnitude of change was highly signifi- 
cant (.91). However, to obtain these results the 
data had to be corrected in a uniform way for 
incredulity (denial of the asserted relation when 
highly incongruent) and for directional effect 
(shifts in concept attitudes being regularly 
greater than shifts in source attitudes). 

Kerrick has tested the congruity hypothesis 
under more realistic conditions.39 The effects of 
an editorial (message) in a weekly newspaper 
(source) urging election of certain people to the 
school board (concept) were predicted from pre- 
exposure attitude scores in a sample of the read- 
ers of this weekly. The average effect of the 
editorial was nil, but when the sample was 
broken into two groups—those relatively favor- 
able toward this weekly and those relatively un- 
favorable—it was found that attitudes toward 
the concept went up for the group favorably 
disposed toward the source and went down for 
the other group. 


2. Cognition and Behavior 


Pressures toward consistency in what one be- 
lieves and how one behaves have been studied 
most intensively by Festinger and his associates 
in a variety of ingeniously contrived, realistic 
experiments.?! Two examples will have to suf- 
fice. 

(a) Consequences of decisions. The need for 
making a decision implies that there are disso- 
nant elements in either course; therefore, to a 
degree dependent on the importance of the 
decision, the relative attractiveness of the un- 
chosen course, and the degree of overlap of 


30 J, S. Kerrick, "The Weekly Newspaper as a 
Source: Prediction of an Editorial’s Effectiveness,” 
unpublished paper, 1959. 

31 Festinger, op. cit. 


elements in the two courses, decisions will leave 
some dissonance to be reduced. 

(b) Exposure to information. Dissonance 
theory also leads to the prediction that people 
will avoid exposure to dissonance-increasing in- 
formation and seek exposure to dissonance- 
decreasing information; if exposure to dissonant 
information is forced, they will defend against 
fully cognizing it.52 


3. Interpersonal Perception 

Although Heider himself has not contributed 
heavily on the experimental side, many other 
people have done research either inspired by or 
at least relevant to his theoretical notions. Jor- 
dan directly tested Heider's prediction that peo- 
ple prefer balanced to imbalanced situations.93 
Sixty-four hypothetical (and, unfortunately, 
very abstract) triadic situations were given to 
subjects to rate for degree of pleasantness, half 
of them being cognitively balanced and half 
unbalanced. An example would be: "I dislike 
O; I have a sort of bond or relationship with 
X; O likes X." The ratings for balanced triads 
were significantly more “pleasant” than ratings 
for unbalanced triads, and Cartwright and Ha- 
rary have shown that if the ambiguous not U 
situations are eliminated, the difference becomes 
even greater. Heider cites a great deal of exper- 
imental evidence by others,34 which cannot be 
reported in detail here, in support of various 
deductions about interpersonal relations that 
follow from cognitive balance theory: Egos tend 
to like Alters who are similar to themselves and 
dislike Alters who are dissimilar; Egos assume 
that Alters they like are more similar, and that 
Alters they dislike are less similar, to themselves 
than is the case; Egos tend to join in groups 
with Alters they like and to come to like Alters 
in groups they join; Egos tend to like what they 
own and to own what they like. 


4, Communication and Group Cohesiveness 


Newcomb cites experiments by Festinger and 
Thibaut,’ by Schachter,3¢ and by Back’? which 


82 For results of the experiments related to this 
proposition, see A. R. Cohen, “Attitudinal Conse- 
quences of Induced Discrepancies between Cogni- 
tions and Behavior,” Public Opinion Quarterly, 
Vol. 24, 1960, pp. 297-318. 

33 N. Jordan, “Behavioral Forces That Are a Func- 
tion of Attitudes and of Cognitive Organization,” 
Human Relations, Vol. 6, 1953, pp. 273-287. 

34 Heider, The Psychology of Interpersonal Rela- 
tions, Chap. 7. 

35 L. Festinger and J. Thibaut, “Interpersonal 
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demonstrate (a) that attempts to influence others 
increase with the attractiveness of the others to 
Ego, (b) that communications in cohesive groups 
tend to be directed toward those perceived as 
most deviant (up to the point where the sign 
of the relation shifts and a deviate is ejected 
from the group), and (c) that communications 
within groups typically result in both increased 
uniformity (co-orientations toward relevant X's) 
and increased cohesiveness (co-attractions among 
members). There are many other experiments 
in which the role of interpersonal communica- 
tion can legitimately be inferred even though 
it was not directly observed, e.g., in a study by 
Festinger, Schachter, and Back? where a corre- 
lation of +.72 was found between a measure 
of interpersonal attractiveness and a measure of 
attitudinal conformity among people in a hous- 
ing project. 


5. Semantic Interactions 


Finally, we will look briefly at a series of 
experiments which—precisely because of their 
remoteness from familiar everyday matters like 
attitudes, beliefs, and interpersonal relations— 
may serve to indicate the generality of our prin- 
ciples. All these studies are alike in that mean- 
ing as measured with a form of semantic 
differential is the dependent variable. They are 
also alike in general design: subjects first judge 
the meanings of a set of component stimuli and 
then judge the meanings of various combina- 
tions of these stimuli; predicted meanings of 
the combinations, derived from applying the 
congruity formula to the component meanings, 
are checked against the actual meanings of the 
combinations derived from subject ratings. 

(a) Word mixture.39 All subjects first rated 
the meanings of eight adjectives ("sincere," 
“breezy,” “listless,” etc) and eight nouns 
("nurse," "husband," “prostitute,” etc.). Then 


Communications in Small Groups,” Journal of Ab- 
normal and Social Psychology, Vol. 46, 1951, 
pp. 92-99, 

86 S, Schachter, “Deviation, Rejection, and Com- 
munication,” Journal of Abnormal and Social Psy- 
chology, Vol. 46, 1951, pp. 190-207. 

37 K. Back, “The Exertion of Influence through 
Social Communication,” Journal of Abnormal and 
Social Psychology, Vol. 46, 1951, pp. 9-23. 

38 L, Festinger, S. Schachter, and K. Back, Social 
ase in Informal Groups, New York, Harper, 
1950. 


39 Osgood, Suci, and Tannenbaum, of. cit, pp. 
275-284. 


they rated eight of the sixty-four possible com- 
binations ("sincere husband," “breezy prosti- 
tute," “listless nurse," etc.), eight groups being 
required to complete the design. Problem: Can 
the meanings of the combinations be predicted 
from the measured meanings of the components? 
Correlations between predicted and obtained 
meanings were high, .86, .86, and .90 for the 
evaluative, potency, and activity factors respec- 
tively. A constant error appeared for the evalu- 
ative factor, however—obtained evaluations of 
the mixtures were consistently less favorable 
than predicted evaluations. No obvious expla- 
nation for this error is at hand. 

(b) Fusion of facial expressions.* The affec- 
tive meanings of five facial expressions posed 
by the same person (intended to convey glee, 
rage, optimistic determination, complacency, 
and passive adoration) were obtained from 
thirty subjects in terms of two dominant factors, 
pleasantness and activation. 'Then the same sub- 
jects viewed all possible combinations of these 
expressions when fused in a stereoscope under 
conditions designed to minimize eye dominance. 
The apparent meanings of these fused expres- 
sions proved to be predictable via the congruity 
formula to a high degree. The median intra- 
subject correlation (across the ten combinations) 
between predicted and obtained scores was .84 
for pleasantness and .82 for activation. Further- 
more, as must be predicted from the congruity 
hypothesis, the semantic profiles for combina- 
tions regularly correlated more highly with the 
more polarized component; that is, the more 
intensely meaningful expression dominated in 
the process of interaction. 

(c) News photos and captions. Kerrick!! de- 
vised two unrelated captions ("A Quiet Minute 
Alone" and "Exiled Communist") to go with 
each of five pictures (e.g., a full profile shot of 
a well-dressed man on a park bench). Subjects 
judged the captions and pictures alone and 
then in combination (the design was such that 
subjects judging one set of captions alone would 
get the other set linked with the pictures). The 
results were strikingly different from those of 
(a) and (b) above: if picture and caption in iso- 
lation had opposed meaning, the meaning of 
their combination was predictable via congru- 
ity; but if picture and caption in isolation had 


40 A. Hastorf and C. E. Osgood, unpublished re- 
search. 

41 J. S. Kerrick, "News Pictures, Captions and the 
Point of Resolution," Journalism Quarterly, Vol. 
36, 1959, pp. 183-188. 
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similar meanings, their effects summated in 
combination, contrary to the congruity princi- 
ple. In other words combining a slightly 
"happy" picture with a slightly "happy" caption 
produced a very "happy" whole. 

(d) Sound movies. A similar summation ef- 
fect was found by Gregory in an unpublished 
study on the combinations of words with sight 
in sound movies? Five short “takes” of an 
actor saying five phonetically similar things 
with different meanings, with appropriate facial 
expressions and gestures, were recorded on 
sound film (e.g, “I can't get over the death of 
my wife," or, “This is the happiest day of my 
life"). Subjects judged the heard words alone, 
the viewed movies alone, the original combina- 
tions, and crossed combinations produced by 
splicing the words of one "take" with the sight 
of another. The results of the crossed combina- 
tions were generally predictable from the com- 
ponents via congruity, but the meanings of the 
"natural" combinations were more polarized 
than either component alone. We can only 
guess at why meaningful summation occurs in 
experiments (c) and (d) but not in experiments 
(a) and (b); both situations (c) and (d) involve 
interactions across modalities, and it may be 
that in such situations something other than 
simple congruity is operating. 


COGNITIVE DYNAMICS IN PUBLIC AFFAIRS 


In the absence of a science of public affairs, 
national and international, we can at least hy- 
pothesize that laws governing the thinking and 
behaving of individuals also govern the "think- 
ing" and "behaving" of groups. The leap from 
individual cognitive maps to the structuring of 
relations within small groups has already been 
made—in some cases apparently without any 
self-consciousness of the shift in reference. How-, 
ever, application of the laws of cognitive dy- 
namics to public affairs can be justified on 
several grounds: first, with nothing but com- 
munication to bind us together, it is clear that 
“decisions” and “behaviors” of nations must 
come down to myriad decisions in individual 
nervous systems and myriad behaviors of indi- 
vidual human organisms. Second, to the extent 
that government is popular, we can work on the 
basis of averaging over individuals, and to the 
extent that government is not, we are back to 
individuals anyhow. Finally, evidence abounds 


42 J. Gregory, unpublished research. 


that we do “personalize” groups and issues—not 
so much by an error of oversimplification as by 
an intuitive grasp of the underlying laws. 

The analogue of a cognitive element for an 
individual is what we may call a cultural mean- 
ing (stereotype, public image, etc.) for a group. 
Although individuals within groups may be ex- 
pected to vary in their private meanings, it is 
characteristic of cohesive groups, as Newcomb 
has shown, for interpersonal communication to 
produce increased uniformity of opinion and 
attitude. Mass communications have this func- 
tion for the larger groupings of individuals in 
modern society, such as nations. Many of the 
applications of the semantic differential—in the 
study of information about mental health and 
illness, of images of political personalities and 
issues, of commercial institutions and products, 
and so forth—have dealt with cultural mean- 
ings based on reasonably representative groups 
of people. The degree of conformity on issues 
is often striking, 90 to 100 per cent of subjects 
frequently choosing the same side, if not the 
same intensity. This happens both for common 
meanings (tornadoes are active) and for attitude 
objects (the Bible is good). 

Now, to the extent that the cultural mean- 
ings of two socially significant referents have 
different evaluative locations, increasing propor- 
tions of individuals will necessarily experience 
pressures toward congruity when these items are 
forced into interaction by assertions in the mass 
media. If, under accusation by his rival or per- 
haps a mantle of honesty, a candidate for public 
office admits that he, John Jones, does not 
believe in God, we have for large numbers of 
people a somewhat unfamiliar and neutral 
source making a negative assertion about a 
deeply entrenched and favorably polarized con- 
cept. The result is inevitable—all the pressure 
toward consistency is on John Jones to move 
in a negative direction. Whenever political can- 
didates, particularly unfamiliar ones, run into 
conflict with the mores—become associated with 
bad things, like gambling, call girls, and di- 
vorce, or become dissociated from good things, 
like Mother, God, and Country—they lose out. 
Conversely, political candidates themselves try 
to establish associative assertions with good 
things and dissociative assertions with bad, by 
their statements and by their behavior. Even if 
the private attitudes and beliefs of the elites in 
popular governments may be at variance with 
such a simple-minded organization of the uni- 
verse, they rarely run counter to the tide. These 
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are all very familiar facts about public affairs, 
but I think they illustrate the underlying laws. 

Among our store of cultural meanings are a 
large number of “personalized” national stereo- 
types. Whether it is valid or not, populations in 
various countries do react to nations as if they 
were a collection of people, having certain 
“personality” traits and being organized into 
unstable group structures. The findings of Bu- 
chanan and Cantril in their book, How Nations 
See Each Other,*8 are quite consistent with this 
idea. Following Heider, if P likes O (e.g., O is 
an ally of P) and P dislikes X (eg, X is an 
enemy of P), then P should attribute favorable 
characteristics to O and unfavorable ones to X. 
We find that Americans most often attribute to 
the British the traits of being intelligent, hard- 
working, brave, and peace-loving, whereas they 
most often attribute to the Russians the traits 
of being cruel, backward, hard-working, and 
domineering (being "hard-working" is obviously 
independent of general evaluation). We would 
also expect people to view their own country- 
men favorably, in the interest of cognitive bal- 
ance; the four most frequent selfattributions 
(across the eight countries sampled) were peace- 
loving, brave, intelligent, and hard-working. 
Although Buchanan and Cantril did not get 
into this matter, Heider's notions of cognitive 
balance would lead us to expect that if P likes 
O and P likes Q, but P dislikes X, then P would 
infer that O likes Q and that O and Q dislike 
X, and vice versa. Americans expect the British 
to favor the French, and vice versa, but they 
expect both to be as antagonistic to the Rus- 
sians as we are. If P likes O, he also must infer 
that O likes P reciprocally; but as studies like 
The Ugly American** show only too clearly, 
this doesn't necessarily follow. 


The operation of psycho-logic in national 


43 W. Buchanan and H. Cantril, How Nations See 
Each Other, Urbana, Ill, University of Illinois 
Press, 1953. 

44 W. J. Lederer and E. Burdick, The Ugly Amer- 
ican, New York, Norton, 1958. ; 


and international affairs shows up quite gener- 
ally once you start to look for it. All nations in 
time of conflict, for example, create “bogey 
men" on this basis. If we are good, kind, and 
fair and they are our enemy, then psycho-logic 
dictates that they must be bad, cruel, and un- 
fair. However, when we are exposed to live Rus- 
sians, as tourists in their country or as hosts to 
them in our homes and farms, and we find them 
in many ways just like us, cognitive disturbance 
is produced; it may be eliminated by the tech- 
nique of differentiation—it is the Russian lead- 
ers who are bad, not the Russian people. And 
we confidently wait for the good Russian people 
to overthrow the bad Russian leaders—just as 
the Russians, no doubt, confidently wait for 
the good American workers to begin the revo- 
lution against their bad capitalist leaders. Abel- 
son illustrates how differentiation works with a 
different example, the conflict over hydrogen 
bomb testing (+) being associated with poison- 
ous fallout (—): if one can accept a distinction 
between using "clean bombs" and "dirty 
bombs," then a cognitively consistent resolution 
is achieved.45 

Such examples could be adduced ad infini- 
tum, but they would not further prove the 
thesis. The essential argument of this paper has 
been as follows: (1) there has been a consider- 
able confluence (if not unanimity) in theories 
of cognitive interaction, sufficient at least to 
make it possible here to formulate several gen- 
eral principles; (2) there is a great deal of evi- 
dence for the operation of such principles in 
the thinking and behaving of human individ- 
uals, in areas as diverse as interpersonal rela- 
tions and the semantics of word combination; 
and (3) we can at least hypothesize the opera- 
tion of the same laws at the level of groups, 
national and international, and find illustra- 
tions which seem to support the hypothesis. 
How useful such a model would be for pre- 
dicting in the area of public affairs remains to 
be seen, 


45 Abelson, op. cit. 


47. Prediction of Attitude from the Number, Strength, and 
Evaluative Aspect of Beliefs about the Attitude Object: 
A Comparison of Summation and Congruity Theories 


LYNN R. ANDERSON and MARTIN FISHBEIN 


In a recent series of articles, Fishbein (1961, 
1963, 1965) has presented a theory of attitude 
organization and change based on the relation- 
ships between beliefs about an object and the 
attitude toward that object. Consistent with 
Osgood, Suci, and Tannenbaum (1957), attitude 
is defined as the evaluative dimension of a con- 
cept, where the term “concept” refers to any 
discriminable aspect of an individual’s world, 
verbalizable or not. In Osgood’s behavior theory 
terminology, an attitude can be described, as 
a mediating evaluative response associated with 
any stimulus (Osgood et al., 1957). Operation- 
ally, an individual's attitude toward any concept 
can be assessed by having the individual rate 
the concept on a series of bipolar evaluative 
scales (e.g., good-bad, clean-dirty) of the seman- 
tic differential form. 

Similar to the definition of attitude, belief is 
defined as the probability dimension of a con- 
cept. Belief in a concept is seen as the proba- 
bility that the concept per se (e.g, ESP) does 
exist, while belief about a concept is defined as 
the probability that a specific relation exists 
between the concept and some other object, 
concept, value, or goal (e.g, ESP is mystical, 
ESP is useful). Beliefs can be measured by hav- 
ing the subject rate the assertion (belief about) 
or the concept per se (belief in) on a series 
of bipolar probabilistic scales (e.g, probable- 
improbable, likely-unlikely) of the semantic 


* Reprinted from Journal of Personality and Social 
Psychology, 1965, 3, 437-448, with permission of the 
authors and the American Psychological Associa- 
tion. This study was supported in part by a grant 
from the University of Illinois Research Board. The 
authors would like to express their special appre- 
ciation to C. E. Osgood for suggesting the test of 
his new extension of congruity theory. 


differential form. This latter instrument has 
been called the B scale by Fishbein and Raven 
(1962), and considerable support for its relia- 
bility and validity has been presented. 

Essentially, Fishbein's theory leads to the pre- 
diction that an individual's attitude toward any 
object is a function of his beliefs about the 
object and the evaluative aspects of those be- 
liefs. 'The prediction can be expressed by the 
following formula: 


N 
4, = > Ba; 


i—1 
where 


A, = the attitude toward object “o” 

B, = the strength of belief i about o (i.e., the 
probability that o is related to some 
other object "x," 

a,= the evaluative aspect of B, (ie, the 
evaluation of x,) 

N — the number of beliefs 


As can be seen from the formula, the attitude 
toward the object is predicted to be a part 
function of the total amount of affect associated 
with each of the beliefs about the object. 
Fishbein (1961) and his associates (Fishbein and 
Hunter, 1964; Triandis and Fishbein, 1963) 
have previously pointed out that this predic- 
tion is opposed to the predictions which would 
follow from most of the theories of attitude 
change based on a notion of "consistency." In 
general, the consistency theories state that an 
individual's attitude toward an object is essen- 
tially based on a mean, or weighted mean, of 
the affect associated with each of the beliefs 
about the object. For example, both the prin- 
ciple of congruity (Osgood et al., 1957; Osgood 
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and Tannenbaum, 1955) and Heider's (1958) 
balance theory would predict that if an indi- 
vidual held two beliefs about one object, one 
belief containing a high positive evaluative as- 
pect and the other a low positive evaluative 
aspect, the individual's attitude toward the ob- 
ject would essentially represent a convergence 
or mean of these two evaluative aspects! In 
contrast, Fishbein's theory would predict that 
the attitude is a function of the algebraic sum 
of the evaluative aspects of the beliefs. To 
summarize, a basic difference between Fish- 
bein's theory and the various consistency theo- 
ries is that Fishbein views attitude organization 
and change as a process of cognitive summation, 
while consistency theories view attitude organi- 
zation and change as a process of cognitive 
balance. 

Two recent studies provide support for the 
summation approach. Triandis and Fishbein 
(1963) compared predicted and obtained values 
of subjects attitudes toward complex stimuli. 
One set of predictions was: based on an exten- 
sion of congruity theory, while another set of 
predictions was based on summation theory. 
The results clearly indicated the superiority of 
the summation theory predictions. Fishbein 
and Hunter (1964) provided four groups of 40 
subjects with different amounts of positively 
evaluated information about a fictitious person. 
The information was presented in such a way 
that the total amount of affect associated with 
the information increased as a function of the 
amount of information, while the mean amount 
of affect associated with the information de- 
creased as a function of the amount of infor- 
mation presented. Consistent with summation 
theory, the amount of attitude change was a 
function of the total amount of affect associated 
with the information presented. In addition, 
indirect evidence suggesting cognitive summa- 
tion has also been obtained by Feldman (1962), 
Kerrick (1959), Podell and Podell (1963), and 
Tanaka (1964). 

The present investigation is a further attempt 
to test Fishbein's theory. While all of the above 
studies provide support for a summation the- 
ory, none are strict tests of Fishbein’s position. 
That is, in none of the above studies was the 
strength of belief (B,) utilized in the prediction 
of attitudes. In addition, with the exception 


1 For a more complete discussion of the distinction 
between summation and balance theory sec Fish- 
bein (1961), Triandis and Fishbein (1963), and Fish- 
bein and Hunter (1964). 


of the Triandis and Fishbein study, none of 
the above studies has attempted to compare the 
explicit quantitative predictions of two dif- 
ferent theories. With respect to this latter study, 
however, Osgood (1963) has recently presented 
an extension of congruity theory that differs 
from the earlier extension utilized by Triandis 
and Fishbein. This extension was in part pre- 
sented in response to studies "which suggest 
an additive rather than a polarity-weighted 
function.” As Osgood points out, however, 
"Whether differences would be obtained in 
predictions under these two conditions . . . [the 
new as opposed to the "old" extension] . . . 
remains to be seen.” : 
Thus, the purpose of the present paper is 
threefold: to provide a further test of the hy- 
pothesis that attitude change is a function of 
the total affect associated with each of a per- 
son's beliefs about the attitude object; to pro- 
vide an explicit comparison between Fishbein's 
summation theory and Osgood's congruity the- 
ory; and to compare Osgood's new extension 
of congruity theory with the extension previ- 
ously utilized by Triandis and Fishbein. 


METHOD 


Subjects 


One hundred undergraduate subjects (50 
males and 50 females) were randomly assigned 
to five groups. The only restriction placed on 
the random assignments was that there would 
be an equal number of males and females in 
each group. 


Procedure 


The experiment was administered under the 
guise of a Literary Preference test in which the 
subjects were to evaluate each of four mimeo- 
graphed paragraphs? taken from a short story. 
The attitude object was an incidental charac- 
ter, Mrs. Williams, who was mentioned in each 
of the four paragraphs. In order to provide a 
pretest measure of evaluation, the subjects, be- 
fore reading the paragraphs, rated the name 
Mrs. Williams and other characters in the story 


2 A copy of the paragraphs used has been deposited 
with the American Documentation Institute. Order 
Document No. 8463 from ADI Auxiliary Publica- 
tions Project, Photoduplication Service, Library of 
Congress, Washington, D. C. 20540. Remit in ad- 
vance $1.25 for microfilm or $1.25 for photocopies 
and make checks payable to: Chief, Photoduplica- 
tion Service, Library of Congress. 
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on the A Scale of Fishbein and Raven (1962). 
The A Scale consists of five empirically deter- 
mined, evaluative scales of the semantic differ- 
ential form. Since each polar pair of adjectives 
had a 7-point scale, a rated concept could ob- 
tain a score ranging from —15 (negative evalu- 
ation) to +15 (positive evaluation). The five 
scales used were: harmful-beneficial, foolish- 
wise, dirty-clean, bad-good, and sick-healthy. 

After the pretest all subjects were then in- 
structed to read and to evaluate each of the 
four paragraphs by circling a letter grade be- 
neath each paragraph. The paragraphs were 
identical in each of the five experimental 
groups except for the inclusion of the evalua- 
tive adjectives used to describe Mrs. Williams. 

Group I. This group represents a control in 
that no adjectives or evaluative characteristics 
were used to describe Mrs. Williams in any of 
the four paragraphs. 

Group II. For the subjects in this group, the 
adjective "honest" was inserted before the noun 
"Mrs. Williams" in the first paragraph. The 
three remaining paragraphs were identical to 
those of Group I. 

Group III. The adjective “honest” again de- 
scribed Mrs. Williams in the first paragraph, 
and the adjective "friendly" was attributed to 
her in the second paragraph. The remaining 
two paragraphs were identical to the control 
paragraphs. 

Group IV. The adjective "honest" was in- 
serted in the first paragraph, "friendly" was 
inserted in the second paragraph, and "help- 
ful" was used to describe Mrs. Williams in the 
third paragraph. The fourth paragraph was 
again identical to the control group. 

Group V. This group was similar to Group 
IV in having Mrs. Williams described as 
"honest," "friendly," and "helpful" in the first 
three paragraphs, respectively. In the fourth 
paragraph, however, the adjective "helpful" 
was repeated. Thus, for the subjects in this 
group, Mrs. Williams was described as being 
helpful in both the third and fourth paragraphs 
of the short story. Group V was included in 
the experiment in order to assess the effects of 
a single repetition of a belief statement on 
attitude change. 

After the subjects had read and evaluated 
each of the four paragraphs, they were told 
that the experimenter was also interested in 
determining the type of impression which the 
paragraphs had created regarding each of the 
characters in the story. The subjects were asked 


again to describe the fictional characters, in- 
cluding Mrs. Williams, on the A Scale. This 
measure provided the posttest attitude score. 

In order to determine if the subjects had 
actually learned the experimental characteris- 
tics attributed to Mrs. Williams, three assertive 
statements (“Mrs. Williams is honest,” “Mrs. 
Williams is friendly," and "Mrs. Williams is 
helpful") were each rated by all subjects on 
Fishbein and Raven's (1962) B Scale. This scale 
is also of the semantic differential form and 
contains five, empirically determined, proba- 
bilistic scales (impossible-possible, false-true, 
nonexistent-existent, improbable-probable, un- 
likely-likely). The scale provides a measure of 
each subject's beliefs about Mrs. Williams, and 
thus provides the belief score (B,) utilized in 
Fishbein's prediction formula. The subjects 
were finally asked to evaluate each of the de- 
scriptive adjectives (honest, friendly, and help- 
ful) on the A Scale. These ratings provide a 
measure of the evaluative aspect of each belief 
(a) used in Fishbein's prediction formula and 
a measure of the evaluation and polarity of 
each of the adjectives used in the congruity 
prediction formulas. 


RESULTS 


Obtained Evaluations of the Attitude Object 


The pretest measure indicated that the five 
experimental groups were comparable in initial 
evaluation of the attitude object (F — .414, df 
= 4/90). Thus, analyses were conducted only 
on the data obtained on the posttest A Scale. 
The main hypothesis predicted that the sub- 
ject’s evaluation of the attitude object, Mrs. 
Williams, would be a function of the total 
amount of affect associated with each of the 
characteristics attributed to the object. Since 
only positively evaluated adjectives were pre- 
sented to the experimental groups, the mean 
evaluative score should increase with the num- 
ber of adjectives presented if the hypothesis is 
to be supported. The mean posttest score for 
each of the five experimental groups is pre- 
sented in Table 1. When tested by an analysis 
of variance, these means proved to be signifi- 
cantly different (F = 6.31, df = 4/90, p < .01).3 


3 The significance test was made on the data ob- 
tained from all five experimental groups. Although 
four adjectives were presented in Group V, only 
three different adjectives were utilized. Hence, the 
data from this group may not represent a direct 
test of the hypothesis. However, when a similar 
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TABLE 1. Obtained Mean Evaluation Scores 


for the Experimental Groups 
Number of 
Experimental Adjectives 
Group Presented Obtained M 
I 0 .95 
II 1 3.15 
IH 2 2.85 
IV 3 5.90 
M 4* 7.10 


* Although four adjectives were used in de- 
scribing Mrs. Williams in Group V, only three 
different adjectives were presented with one 
adjective being presented twice. 


In addition, the slope of the regression of atti- 
tude scores on number of adjectives presented 
was tested and found to be significantly differ- 
ent from zero (F = 11.98, df = 1/90, p < .001). 
Thus, in support of a summation theory, atti- 
tude scores do increase significantly with the 
number of adjectives presented, and therefore, 
with the total amount of affect associated with 
each belief. 


Differential Predictions of Attitude towards the 
Attitude Object 


Three predicted attitude scores were com- 
puted for each subject: based on Fishbein’s 
summation theory, based on Triandis and 
Fishbein's extension of congruity theory, and 
based on Osgood’s more recent extension of 
congruity theory. 

Summation theory predictions. According to 
the prediction formula presented by Fishbein, 
an individual's attitude toward any object 
should be a function of the sum of the prod- 
ucts of beliefs about the object (B) and the 
evaluative aspect of these beliefs (2). Each sub- 
ject had indicated the strength of his beliefs 
that “Mrs, Williams is ‘honest,’ ‘friendly,’ and 
‘helpful,’ ” and had evaluated each of the three 
descriptive adjectives. Thus, by direct appli- 
cation of the formula (ie, the predicted 
attitude = X,” Ba) a predicted attitude score 
was computed for each subject. 

Congruity theory predictions: Triandis and 
Fishbein's extension. In a recent paper, Tri- 
andis and Fishbein (1968) presented a simple 
mathematical extension of congruity theory 


analysis of variance test was made using only the 
data from the first four groups, the F ratio was sig- 
nificant beyond the .05 level. 


that permitted the consideration of more than 
a single adjective-noun combination. The for- 
mula is identical to the one presented by 
Osgood et al. (1957), but it allows for the case 
where more than one adjective is associated 
with the noun. The formula is given as follows: 


52 lao | + [as|(a5) 


Predicted attitude — LL ee 
DI 
i=1 


|a,| = the absolute evaluation of polarity of 
adjective “i” 
(a;) = the algebraic evaluation of adjective i 
|a,| = the absolute evaluation or polarity of 
the noun 
(a,) = the algebraic evaluation of the noun 
N — number of adjectives associated with 


the noun 


where 


By direct application of the formula, predic- 
tions were made for each subject based on his 
pretest evaluation of the noun "Mrs. Williams" 
and his posttest evaluations of each of the 
descriptive adjectives. It should be noted, how- 
ever, that only those adjectives which were 
presented to the subject in the communication 
were utilized in computing his predicted atti- 
tude score. Although only three different ad- 
jectives were presented in Group V, the second 
presentation of the adjective "helpful" was 
treated in the formula as a separate adjective. 

Congruity theory predictions: Osgood's new 
extension. Osgood (1963) has recently stated 
that the formula proposed by Triandis and 
Fishbein may be inadequate in representing 
the actual cognitive "balancing" that takes 
place as the individual is given more informa- 
tion about an attitude object. The objection is 
based mainly on the notion that a phrase such 
as “he is a wise, unscrupulous man” should be 
viewed as two separate assertions: the man is 
wise, and the man is unscrupulous. 

“According to this view, the meaning of man 
resulting from the man is wise should first be 
determined and then ‘plugged’ into the second 
assertion, the man (as qualified) is unscrupu- 
lous (Osgood, 1963, p. 103)." Thus, the major 


* In the case where only one adjective was pre- 
sented the formula is identical to the one presented 
by Osgood et al. (1957). 
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objection to the formula proposed by Trandis 
and Fishbein is that it does not take into ac- 
count this step-by-step convergence of evalua- 
tion, but rather, assumes that convergence 
occurs in one cognitive operation. Although 
Osgood's extension was formulated specifically 
to handle the prediction of complex stimuli 
(e.g, the wise, unscrupulous man; honest, 
friendly Mrs. Williams), it can be seen that his 
objection is particularly apropos in the present 
experiment since the subjects actually received 
separate assertions about Mrs. Williams in the 
communication. The general formula for the 
new extension is given as follows: 


ai (2) + lanal Gn) 


Predicted 
a] + lanl 


attitude — 


where 

|aj| and (aj) = the absolute and alge- 
braic evaluation of the 
“ith” adjective 

| and (a,,_,) = the absolute and alge- 
braic evaluation of the 
noun modified by i— 1 
adjectives 


ls, 1 


For example, the predicted attitude for a group 
receiving two adjectives and a noun would be 
a function of the evaluation of the second ad- 
jective and the evaluation of the noun after it 
had been modified by the first adjective. Once 
again, the subjects’ evaluations of the adjectives 
and their pretest evaluation of Mrs. Williams 
were used to compute the predicted scores. 
Here too, only those adjectives actually pre- 
sented to the subjects were included in the 
computations. It should also be recalled that 
in Group V, which received the adjective help- 
ful twice, the adjective was considered as two 
separate adjectives (i.e, Mrs. Williams, modi- 
fied by honest, friendly, and helpful, is again 
modified by helpful for the final predicted 
score). 

The intercorrelations of the three predicted 
scores and the obtained scores may be seen in 
Table 2. All three of the predictors are signifi- 
cantly correlated with the obtained scores (p < 
.001). However, consistent with previous find- 
ings, the predictions based on Fishbein's sum- 
mation theory are significantly more accurate 
than the predictions based on either form of 
the congruity theory (Z = 2.74, p < .01 for both 
comparisons) These findings provide further 
support for Fishbein's theory and once again 
suggest that attitude organization and change 


TABLE 2. Intercorrelations of the Various 
Predicted Scores and. the Actual Posttest 
Evaluation of the Attitude Object 


(N — 100) 
Congruity Obtained 
Congruity Prediction Fishbein Posttest 

Prediction (Extended) Prediction Evaluation 

[0] (2) 9. (4) 

1 - 86 46 38 

2 — E .39 

3 — .66 

4 -— 


Note. All correlations are significant beyond the .001 level. 


is a function of cognitive summation rather 
than cognitive balance. Table 2 also indicates 
that Osgood's new extension of congruity is 
highly correlated with the extension previously 
utilized by Triandis and Fishbein (r = .86) and 
perhaps more importantly, indicates that the 
new extension adds no predictive power over 
the original formulation. 

Two basic objections could be raised con- 
cerning the intercorrelations presented in Ta- 
ble 2. First, with respect to the comparison 
between summation and congruity theory, it 
should be noted that in computing the sum- 
mation theory predictions all of an individual's 
beliefs are taken into account in the computa- 
tion, regardless of whether or not they were 
actually included in the communication. The 
congruity formulas, on the other hand, use only 
the evaluation of those adjectives actually in- 
cluded in the message presented to the subject. 
Thus, the obtained differences between sum- 
mation and congruity may be artifacts produced 
by the utilization of different amounts of in- 
formation in the prediction formulas. Second, 
with respect to the comparison of the two 
extensions of congruity theory, it is possible 
that much of the correlation is spurious. Both 
extensions lead to identical predictions when 
only one adjective is associated with the noun 
(Group II) or when no adjective is associated 
with the noun (Group I). Thus, 40 of the 100 
scores predicted by both formulas are identical. 

In order to check against both of those ob- 
jections, the correlations reported in Table 2 
were computed again for only those subjects in 
Group IV (ie. the group receiving all three 
adjectives in the communication). Thus, for 
these 20 subjects, all three predicted scores are 
obtained from the same amount of data, and 
hence maximal differences between the two 
extensions of congruity theory should be ob- 
tained. While subjects in Group V also received 
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all three adjectives, it should be recalled that 
in the computation of the congruity predictions 
the repeated adjective was treated as a fourth 
adjective and thus the same problem of utiliz- 
ing different amounts of information would be 
encountered if Group V had been used in the 
present analysis. 

The intercorrelations of the three predicted 
scores and the obtained scores for the 20 sub- 
jects in Group IV may be seen in Table 3. The 


TABLE 3. Correlation of the Various Predicted 
Scores and the Actual Posttest Evaluation 

of the Attitude Object for Group IV 

(N = 20) 


Congruity Obtained 
Congruity Prediction Fishbein Posttest 
Prediction (Extended) Prediction Evaluation 
(1) (2) (3) (4) 


1 — .69** .80 .98 
2 = 46" 34 
3 — .66** 
4 — 
*pb«.05 
**p«c.0l 


results of the analyses based on the smaller 
sample are virtually identical to the results 
presented in Table 2. Thus, the higher accu- 
racy of prediction obtained from the summa- 
tion theory formulation is obviously not. a 
statistical artifact of the number of adjectives 
induded in the formula. Similarly, although 
the correlation between the two extensions of 
congruity theory is slightly lower (r = .69), it is 
still highly significant (p < .01). Finally, it can 
once again be noted that Osgood's extension 
of congruity theory does not lead to an increase 
in predictability. To summarize briefly then, 
these results strongly support a summation 
theory, and in particular, Fishbein's theory of 
the relationships between beliefs and attitudes. 
In addition they seriously question the utility 
of Osgood's new extension of congruity theory, 
as well as the older extension suggested by 
Triandis and Fishbein. 

Effect of repetition on attitude change. Sub- 
jects in Group V were presented with the three 
different adjectives as well as a repetition of 
one of the adjectives (helpful). A comparison 
of this group's attitude toward Mrs. Williams 
with the mean attitude of Group IV, which 
received only the three adjectives, constitutes 
a test for the effect of repetition on attitude 
change. Subjects in Group V were significantly 
more favorable toward Mrs. Williams than 
were subjects in Group IV (7.10 versus 5.90, 


t=7.31, df = 38, p < .001).5 Thus it appears 
that attitudes can be changed by even one 
repetition of the “message.” According to Fish- 
bein’s theory, this change should result pri- 
marily from an increase in the strength of 
belief that "Mrs. Williams is helpful." If pre- 
senting the adjective twice in Group V did 
serve to strengthen the subjects' beliefs, then 
the statement “Mrs. Williams is helpful" should 
have been rated higher (i.e. more probable) 
in Group V than in Group IV. Consistent with 
this prediction, the obtained difference between 
the means (7.45 versus 3.75) was significant 
(t — 1.79, df = 88, p < .05 using a directional 
test of significance). 

Indeed, one would expect that the main locus 
of effect of a communication would be to 
change and/or increase subjects' beliefs about 
the attitude object. That is, subjects whose 
communications mentioned "honest Mrs. Wil- 
liams," should believe that "Mrs. Williams is 
honest" more strongly than subjects who did 
not receive this information. Similarly, subjects 
receiving information about "friendly Mrs. 
Williams" should believe that "Mrs. Williams 
is friendly" more strongly than subjects not 
receiving this information. While the overall 
effects suggest that this is true (the mean belief 
scores on those statements contained in the 
communications — 6.26, while the mean belief 
scores on those statements not contained in the 
communications = 5.19; t, corrected for corre- 
lated scores, = 2.09, df = 298, p < .05), analy- 
ses of the individual belief statements suggest 
a more complex relationship between infor- 
mation contained in the communication and 
belief change. For example, although subjects 
in Group II only had the adjective honest 
paired with Mrs. Williams, this information 
produced a significant increase in their belief 
that “Mrs. Williams is friendly,” as well as the 
expected increase in the belief that “Mrs. 
Williams is honest.” Although these findings 
suggest some generalization, the results are not 
indicative of any simple or systematic effect. 
A fuller discussion and interpretation of these 
data must await additional research. 


SUMMARY AND CONCLUSIONS 


This paper presented a comparison between 
two competing theories of attitude organization 
and change. On the basis of Fishbein's (1961) 


5 Unless otherwise noted, all t tests are two-tailed. 
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summation theory, it was predicted that an indi- 
vidual's attitude toward any object would be a 
part function of the total amount of affect asso- 
ciated with each of the individual's beliefs 
about the object. In contrast to this, Osgood's 
congruity theory, and indeed, most theories 
based on a principle of consistency or balance, 
predicts that attitude is a part function of the 
mean amount of affect associated with an indi- 
vidual's beliefs. The results strongly supported 
the summation theory hypothesis. 

In addition, an explicit comparison of sum- 
mation and congruity theory was made. Predic- 
tions based on Fishbein's summation theory 
correlated significantly higher with obtained at- 


ity theory. This result held true for Osgood's 
more recent extension of congruity theory, as 
well as an earlier extension proposed by Trian-- 
dis and Fishbein. Indeed, predictions derived 
from Osgood's extension were no more highly 
correlated with the obtained attitudes than 
were the predictions based on the earlier ex- 
tension. Further, the two sets of congruity pre- 
dictions were highly correlated with each other. 
Thus, these results add further support to the 
hypothesis that attitude organization and 
change is a process of cognitive summation 
and not a process of cognitive balance, and 
point once again to the necessity of reexamin- 
ing the validity of the consistency principle in 


titude scores than predictions based on congru- attitude research. 
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48. An Experimental Analysis of Self-Persuasion 


DARYL J. BEM 


Self-awareness, one’s ability to respond differ- 
entially to his own behavior and its controlling 
variables, is a product of social interaction 
(Mead, 1934; Ryle, 1949; Skinner, 1953; Skin- 
ner, 1957). Among the responses that comprise 
self-awareness, verbal statements that are self- 
descriptive are perhaps the most common, and 
the general procedures by which the socializing 
community teaches an individual to describe 
his own overt behavior would not seem to differ 
fundamentally from the methods used to teach 
him to describe other events in his environ- 
ment. The community, however, faces a unique 
problem in training the individual to make 
statements describing internal stimuli to which 
only he has direct access, for the conditioning 
of the appropriate verbal responses must neces- 
sarily be based on the public stimuli and re- 
sponses that often accompany or resemble these 
private events. Skinner (1953, 1957) has pro- 
vided a detailed analysis of the limited resources 
available to the community for training its 
members thus to "know themselves" and he 
has described the inescapable inadequacies of 
the resulting knowledge. 

One implication of Skinner's analysis is that 
many of the self-descriptive statements that 
appear to be exclusively under the discrimina- 
tive control of private stimulation may, in fact, 
remain under the control of the same public 
events which members of the community them- 
selves must use in "inferring" the individual's 
inner states. In our well-fed society, for exam- 
ple, it is not uncommon to find a man consult- 
ing his wrist watch to answer the question, 
"Are you hungry?" There is also direct experi- 
mental evidence that an individual relies on 
external cues for describing his emotional 
states (Schachter and Singer, 1962). Attitude 


* Reprinted from Journal of Experimental Social 
Psychology, 1965, 1, 199-218, with permission of the 
author and the publisher. 
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statements may be similarly controlled. For 
example, when the answer to the question, “Do 
you like brown bread?" is, "I guess I do, I'm 
always eating it," it seems unnecessary to in- 
voke a font of privileged selfknowledge to 
account for the reply. In this example, it is 
clear that the discriminative stimuli control- 
ling the attitude statement reside in the indi- 
vidual's overt behavior; indeed, the man's reply 
is functionally equivalent to the reply his wife 
might give for him: "I guess he does, he is 
always eating it." 

It is the major thesis of this report, then, 
that an individual's belief and attitude state- 
ments and the beliefs and attitudes that an 
outside observer would attribute to him are 
often functionally equivalent in that both sets 
of statements are "inferences" from the same 
evidence: the public events that the socializing 
community originally employed in training the 
individual to make such self-descriptive state- 
ments. The three experiments reported below 
provide support for this hypothesis by demon- 
strating that an individual's belief and attitude 
statements may be predicted and controlled by 
manipulating his overt behavior and the stimu- 
lus conditions under which it occurs in ways 
that would lead an outside observer to infer 
that the individual held the "belief" or "atti- 
tude" we wish to obtain. The individual, in 
short, is regarded as an observer of his own 
behavior and its controlling variables; accord- 
ingly, his belief and attitude statements are 
viewed as "inferences" from his observations. 


TACTS, MANDS, AND COMMUNICATOR 
CREDIBILITY 


A descriptive statement, a verbal response 
that is under the discriminative control of some 
portion of the environment, is classified as a 
"tact" (Skinner, 1957). A speaker is trained to 
describe or "tact" his environment for the 
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benefit of his listeners who provide generalized 
social reinforcement in return. An individual's 
belief and attitude statements are often tacts 
of stimuli arising from himself (eg. "I am 
hungry"), his behavior (e.g., “I am generous"), 
or the effects of stimuli on him (e.g., “It gives 
me goosepimples”). Attitude statements in 
particular have the properties of tacts of the 
reinforcing effects of a stimulus situation on 
the individual (e.g, "I detest rainy weather,” 
“I'd walk a mile for a Camel”). 

Verbal responses that are under the control 
of specific reinforcing contingencies are called 
“mands.” A speaker who emits a mand is ask- 
ing for, requesting, or “manding” a particular 
reinforcer (cf. demands, commands). Only a 
characteristic consequence will serve to rein- 
force the response, and often this reinforcer is 
specified explicitly by the response (e.g., “Please 
pass the milk”). Mands need not be verbal in 
the usual sense; for example, pointing to the 
milk pitcher may be functionally equivalent 
to the vocal request. Mands are often disguised 
as tacts as in “I believe you have the sports 
page” or as in the case of the television an- 
nouncer who praises a product he is selling; 
his verbal behavior is a mand for the salary 
he receives and may not at all be under the 
actual discriminative control of the features 
of the product he appears to be tacting. A lie 
is often a mand for escape from aversive conse- 
quences; it, too, is a mand disguised as a tact. 
Any particular verbal response, then, may have 
both mand and tact characteristics in differing 
degrees. Thus, until the controlling circum- 
stances are specified, it is not possible to deter- 
mine the functional classification of a remark 
like “Darling, you look beautiful tonight”; the 
probabilities are high that it is a subtle blend 
of mand and tact. 

It is clear, then, that in attempting to infer 
a speakers "true" beliefs and attitudes, the 
listener must often discriminate the mand-tact 
characteristics of the communication. This is, 
in fact, an important dimension of "communi- 
cator credibility." A communicator is credible to 
the extent that his communication is discrimi- 
nated as a set of tacts, and his credibility is 
vitiated to the extent that he appears to be 
manding in the form of disguised tacts. Thus, 
a communication attributed to J. Robert Op- 
penheimer is more persuasive than the same 
communication attributed to Pravda (Hovland 
and Weiss, 1951); the white coat and stetho- 
scope on the television announcer are intended 


to indicate to the viewer that the announcer 
is one whose verbal behavior is under discrimi- 
native control of the product, not one who is 
manding money. Not only is a credible com- 
municator more likely to persuade his listeners, 
but to the extent that his verbal responses ap- 
pear to be "pure" tacts, they will be judged, 
by definition, to be his own "true" beliefs and 
attitudes. We turn now to evidence that the 
beliefs and attitudes of the communicator him- 
self may be viewed as self-judgments based 
partially upon his credibility as a communi- 
cator; and, to the extent that this is so, they 
will coincide with judgments of his beliefs and 
attitudes that outside observers would make. 

In an experiment by Festinger and Carlsmith 
(1959), two experimental groups of 20 under- 
graduates were employed as subjects. In the 
$1 condition, the subject was first required to 
perform long repetitive laboratory tasks. He 
was then hired by the experimenter as an 
"assistant" and paid one dollar to tell a waiting 
fellow student that the tasks were enjoyable 
and interesting. In the $20 condition, the sub- 
jects were hired for $20 to do the same thing. 
A panel of judges, in a blind rating procedure, 
rated the $20 persuasive communications as 
slightly but insignificantly more persuasive than 
$1 communications. Attitude measurement 
showed that subjects paid $1 evaluated the 
tasks and the experiment more favorably than 
did $20 subjects. We may interpret these find- 
ings within the present framework by consid- 
ering the viewpoint of an outside observer who 
hears the individual making favorable state- 
ments about the tasks, and who further knows 
that the individual was paid $1 or $20 to en- 
gage in this behavior. When asked to judge 
the "true" attitude of the communicator, an 
outside observer would almost certainly judge 
a $20 communication to be a mand, behavior 
not at all under the control of the actual fea- 
tures of the laboratory task the individual 
appears to be tacting. Although a $1 communi- 
cation also has mand properties, an outside 
observer would be more likely to judge it than 
the $20 communication to be a set of tacts, and 
hence, by definition, to be the "true" attitudes 
of the individual. If one places our hypotheti- 
cal outside observer and the communicator in 
the same skin, the findings obtained by Festin- 
ger and Carlsmith are the result. 

Blind evaluations of the persuasive commu- 
nications were an elegant control feature of 
the Festinger-Carlsmith design, and, as men- 
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tioned, showed $1 communications to be no 
more persuasive than $20 communications 
when the mand-tact conditions under which 
the verbal behavior was emitted were not avail- 
able to the observer. But this is precisely the 
information which makes one communicator 
more credible than another. 

Cohen (Brehm and Cohen, 1962, p. 73) 
performed an experiment similar to the Festin- 
ger-Carlsmith study in order to rule out the in- 
terpretation that the $20 payment in the latter 
study was so large that it engendered suspicion 
and resistance, leading subjects to think, “It 
must be bad if they're paying me so much for 
it." (See also, Rosenberg, 1965). Since this alter- 
native interpretation is not unlike the mand- 
tact conceptualization offered here, it is 
relevant to examine Cohen's subsequent experi- 
ment in some detail. 

Cohen's subjects were offered $.50, $1, $5, or 
$10 to write an essay against their initial opin- 
ions on a current issue. The post-essay belief 
statements essentially duplicated the Festinger- 
Carlson results: the higher the inducement, the 
less the belief statement coincided with the 
view advocated in the essays. (The $5 and $10 
conditions did not differ significantly from the 
control group who were simply asked their 
opinions on the issue.) The crux of Cohen's 
arguments resides in the fact that significant 
differences in  postessay belief statements 
emerged between the $.50 and the $1 condi- 
tions, and between them and the control con- 
dition. Since these payments were small and 
close to one another, Cohen's argument im- 
plies, the mand-tact discrimination could not 
account for the results. 

This is, of course, an empirical question. 
The following study was designed to answer it 
by demonstrating that the belief statements 
made by Cohen's subjects when they were asked 
for their “true” opinions may be viewed as 
judgments based on the mand-tact characteris- 
tics of their own behavior, that is, on their 
own credibility as communicators. 


EXPERIMENT I. AN INTERPERSONAL 
REPLICATION OF THE ESSAY STUDY 


If the suggested interpretation of Cohen's 
results is correct, then an external observer 


NOT  VERYLITTLE LITTLE SOMEWHAT QUITE 


should be able to replicate the true belief state- 
ment of one of Cohen’s subjects with an inter- 
personal judgment if this observer is told the 
behavior of the subject and the apparent con- 
trolling circumstances of that behavior. 

The subjects in the present study thus served 
as external observers; each subject judged one 
—and only one—volunteer in one of Cohen's 
experimental conditions. 


Method 


Sixty undergraduates were randomly assigned 
to “$.50,” "$1," and “control” conditions. The 
first two groups were given the following in- 
structions on a single sheet of paper; it consists 
of a description of the experimental situation 
employed by Cohen: 


In the Spring of 1959, there was a s udent 
“riot” at Yale University in which the New 
Haven Police intervened with resulting accusa- 
tions of police brutality toward the students. 
The issue was a very bitter and emotional one, 
and a survey of student opinion showed most 
of the student body to be extremely negative 
toward the police and their actions and sym- 
pathetic toward the students. 

As part of a research project, a student mem- 
ber of a research team from the Institute of 
Human Relations at Yale selected a student 
at random and asked him to write a strong, 
forceful essay entitled, “Why the New Haven 
Police Actions Were Justified,” an essay which 
was to be unequivocally in favor of the police 
side of the riots. The decision to write such 
an essay or not was entirely up to the student, 
and he was told that he would be paid the 
sum of $.50 [$1.00] if he would be willing to 
do so. The student who was asked agreed to 
do so, and wrote such an essay. 

The scale shown below was used in the 
original poll of student opinion on the issue. 
From this description, estimate as well as you 
can the actual opinion of the student who was 
willing to write the essay. Indicate your esti- 
mate by drawing a line through the appropriate 
point on the scale. 

“Considering the circumstances, how justified 
do you think the New Haven police actions 
were in the recent riot?” 
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AT ALL 
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The control Ss in the present study received 
the same instructions except that the entire 
second paragraph was deleted and the third 
paragraph was altered to read: “From this de- 
scription, estimate as well as you can the actual 
opinion of a student selected at random on 
the Yale campus. . . .” 

In Cohen's experiment the subjects first wrote 
the essay and were then asked to indicate their 
own opinions on the scale. The scale employed 
in the present study is identical to Cohen's. 
The "control" condition in the present study 
also provides a check on the adequacy with 
which the situation on the Yale campus has 
been described. 


Results and Discussion 


Figure 1 shows the interpersonal judgments 
of Ss in the present experiment compared to 
the belief statements collected by Cohen. Two- 
tailed probability levels based on ¢ tests are 
also shown. It is seen that Cohen's results are 
closely replicated. 

These results show the mand-tact interpre- 
tation of the general inverse relation between 
amount of payment and subsequent belief state- 
ments is still viable: the lower the payment, 
the less the mand properties of the observed 
behavior predominate in the eyes of both intra- 
and inter-personal observers. To the extent 


"Actual" beliefs (Cohen) 


p «05 


Somewhat 
un 


that the behavior has non-mand properties, it 
will be discriminated by an observer (including 
the individual himself) as indicating the "true" 
beliefs of that individual. 

It should be noted that Ss in the experiment 
just presented did not actually read any of the 
essays written by Cohen's Ss. The successful 
interpersonal replication of his results, then, 
suggests that it may not have been necessary 
for his $s to write the essays; that is, the be- 
havior of volunteering may be the source of 
discriminative control over the beliefs. The 
"communication" may be contained in the com- 
mitment to write the essay, and it is the 
commitment that has the crucial mand or non- 
mand properties depending upon the payment 
offered. Brehm and Cohen (1962, pp. 115-116) 
cite a number of studies which, indeed, do 
demonstrate that commitment alone is suffi- 
cient; the essays themselves do not have to be 
written. In order to provide direct support for 
our interpretation that the individual's subse- 
quent belief statement in such experiments 
may be viewed as self-judgments based on his 
behavior of volunteering and its mand charac- 
teristics, it seemed desirable to attempt an inter- 
personal replication of a study in which 
persuasive communication per se did not play - 
a part. The study chosen is of additional inter- 
est because the dependent variable is a judg- 
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Figure 1. A comparison of “actual” beliefs and interpersonal judgments of belief. 
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ment of hunger, a tact that is not only 
self-descriptive, but is typically considered to 
be under the exclusive control of private in- 
ternal stimuli. 


EXPERIMENT II. AN INTERPERSONAL 
REPLICATION OF THE HUNGER STUDY 


Volunteer male Ss in an experiment by 
Brehm (Brehm and Cohen, 1962, pp. 133-136) 
were required to go without food for a num- 
ber of hours before an individual experimental 
session disguised as an investigation on hunger 
and intellectual performance. At the beginning 
of the session, the S indicated on a question- 
naire scale how hungry he felt. After engaging 
in a number of tasks, the $ was asked to volun- 
teer for further testing, which would entail 
continued food deprivation. Half of the Ss 
were offered $5 for volunteering; others were 
offered nothing. A second self-evaluation of 
hunger, administered after the $ had volun- 
teered, showed that those offered the money 
rated themselves significantly more hungry 
than those who had volunteered for nothing. 

As before, we interpret these results by assum- 
ing the standpoint of an outside observer who 
sees the S volunteering, and is then asked to 
judge the volunteer's hunger. If the observer 


The design of the present experiment is the 
same as the interpersonal replication of the 
essay study; each S in the present experiment 
judges only one “other.” 


Method 

Fifty male college students were randomly 
assigned to a "$5" or a "no-pay" condition. 
Those in the "$5" condition were given the 
following instructions on a single sheet of paper. 


Volunteer male subjects in a psychology ex- 
periment on the effect of hunger on motor and 
intellectual functioning were required to go 
without food from the time they got up in 
the morning until the afternoon testing session. 
The volunteers received experimental credit 
points in their introductory psychology course 
for this experiment. 

When the subject arrived, he was shown the 
sandwiches, cookies, and milk on the testing 
table and told he would be able to eat as much 
of these as he wanted after the testing session. 
The subject then indicated how hungry he 
was on the rating scale shown below; the arrow 
shows the average degree of hunger for all the 
subjects who participated in this research at 
the beginning of this session. 

"How hungry are you?" 


broth eon UU TTL T9 NP B. oma ya o m ione ets | 
NOTAT VERY SLIGHTLY MODER- QUITE VERY EXTREMELY 
ALL SLIGHTLY ATELY 


were to see an $ volunteering for the sum of 
$5, he would discriminate the mand properties 
of that behavior, which is to say that he would 
be able to conclude very little about the S's 
hunger; the internal discriminative stimuli 
arising from "hunger," the observer would as- 
sume, would not be a controlling variable of 
the decision to volunteer. If the observer had 
been told the volunteer's initial hunger rating, 
he would probably conclude that the S had 
simply increased his hunger somewhat over the 
intervening time. The behavior of the unpaid 
volunteer, however, has no obvious mand prop- 
erties; specific monetary reinforcement is not 
a controlling variable. Hence, by volunteering, 
the $ tells the outside observer (and himself) 
that he is not very hungry; his volunteering, 
relative of course to Ss paid $5, is not a mand. 

If this interpretation of the Brehm results 
is correct, then external observers should be 
able to replicate with interpersonal judgments 
the self-ratings of hunger obtained by Brehm. 


After giving this rating, the subject engaged 
in a number of motor and intellectual tasks. 
One of the subjects was then asked if he would 
be willing to return for further testing in the 
evening; this, it was explained, would require 
him to continue going without food until about 
eight or nine that evening. He was told, “Un- 
fortunately, we cannot give you any more ex- 
perimental credit points for the evening testing 
session. However, we can pay anyone willing 
to come back since we do need your help. The 
amount is $5.00." 

This individual agreed to continue his par- 
ticipation. From this description, try to esti- 
mate how hungry this particular person. must 
have been at the end of the first session just 
described. Indicate your estimate by drawing 
a line through the appropriate point on the 
scale above. 


Subjects in a "no-pay" condition received 
identical instructions except that the reported 


——————— 
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offer of $5 was replaced by the following: “You 
get your point for having taken part this after- 
noon so don't think you have to come again 
this evening. But we would appreciate your 
help. If, however, you don't feel like doing it, 
that's okay because we can get someone else 
to do it." 

The scale and reported instructions to the 
volunteers are identical to those employed by 
Brehm. The arrow given on the present instruc- 
tion sheet represents the initial hunger rating 
of Brehm's $5 subjects. 


Results and Discussion 


Figure 2 shows the interpersonal judgments 
of Ss in the present experiment compared to 
the self-rating data collected by Brehm. 
(Brehm's result for the no-pay condition has 
been expressed as a displacement from the ar- 
row rather than from the actual initial hunger 
rating of his no-pay Ss.) The separation between 
the results of the treatments in Brehm's study 
was statistically significant at the .01 level; the 
separation in the interpersonal data is signifi- 
cant at the .025 level (two-tailed t test). It is 
clear that Brehm's results for intrapersonal 
judgments are closely replicated by the results 
for interpersonal judgments in the present 
study. 

We conclude that the self-ratings of hunger 
obtained by Brehm were partially under the 


Self-judgements of hunger (Brehm) 


UA 


No-pay 


control of the mand characteristics of the be- 
havior of volunteering, and that the judgments 
made by Brehm's subjects vis-a-vis their own 
behavior does not differ significantly from judg- 
ments based on their behavior made by an 
external observer. 


Alternative Manipulations of the Self- 
Credibility Parameter 


The crux of the present interpretation is 
that the control over beliefs exercised by an 
individual's own behavior will vary as the con- 
uugencies of specific reinforcement for engag- 
ing in the behavior are made more or less 
prominent as a controlling variable. As the 
contingencies of reinforcement are made more 
subtle, less discriminable, the mand properties 
of the $'s own behavior will be lessened and 
the behavior will, thereby, have maximum con- 
trol over his subsequent beliefs. In the studies 
discussed in detail so far, this self-credibility 
discrimination has been manipulated by vary- 
ing the magnitude of reinforcement offered for 
engaging in the overt behavior: the larger the 
reinforcement, the more likely it was that the 
S would discriminate his own behavior as a 
mand. We turn now to studies which employed 
a number of different stimulus operations, all 
with the effect of manipulating the mand prop- 
erties of the induced behavior. 

In an experiment by Aronson and Carlsmith 


leececcccelvcececcecieccecceoc|cccoccccelecccecess|ecceccceei 


Not Very 
at all slightly 


| Interpersonal judgements of hunger 


Slightly l1 Quite Very 


Extremely 


Figure 2. A comparison of self-judgments and interpersonal judgments of hunger. 
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(1963), school children approximately four 
years of age gave preference rank-orderings on 
a set of five toys. The experimenter then told 
the subject that he could play with any of the 
toys but one while the experimenter ran an 
errand. The forbidden toy was the one which 
ranked second in the child's preference order- 
ing. Children assigned to a mild-threat condi- 
tion were further told that the experimenter 
would be annoyed if the child played with the 
forbidden toy; children assigned to a strong- 
threat condition were told that if they dis- 
obeyed, the experimenter would be very angry, 
take all of the toys away, never come back, 
and think that the child was "just a baby." 
The experimenter then left the room and ob- 
served the child through a one-way mirror for 
ten minutes. None of the subjects played with 
the forbidden toy. After this interval, prefer- 
ence orderings on the five toys were again ob- 
tained from the subject. The final preference 
ranking of the forbidden toy was the depen- 
dent variable under investigation. It was found 
that subjects in the mild-threat condition as- 
signed significantly lower final preference rank- 
ings to the toy than did children in the 
strong-threat condition. Rankings obtained 45 
days later still showed some evidence of the 
differential effects of the treatments. 

In interpreting these results, it is again in- 
structive to assume the standpoint of an outside 
observer who sees the child playing with all 
the toys but one; in addition, we assume that 
the observer has been told that the child was 
given a mild [strong] threat of aversive conse- 
quences if he played with the toy. The outside 
observer is now asked to estimate the child's 
ranking of the forbidden toy. We may infer 
that the behavior of avoiding the toy on the 
part of subjects in the mild-threat condition 
has less the property of a mand (for withhold- 
ing aversive consequences) than the behavior 
of subjects in the strong-threat condition. Thus, 
a hypothetical observer of a subject in the 
mild-threat condition would be more likely to 
assume that the actual reinforcing properties 
of the toy were partially controlling the ob- 
served behavior than would an observer judg- 
ing the behavior of a subject in the strong- 
threat condition. The former observer would, 
then, assign a lower preference rating to the 
toy than the latter observer. These are the ob- 
tained findings, which are, then, consistent with 
the interpretation that the final ranking of the 
toy given by the child himself is primarily a 


self-judgment based on his own behavior and 
its controlling variables, rather than a tact of 
the reinforcing properties of the toy being 
ranked. 

The studies which have been discussed so 
far are among several "forced compliance" 
experiments conducted within the framework 
of Festinger's theory of cognitive dissonance 
(1957). Each of these studies yields data con- 
sistent with the present analysis: when the 
operations employed are submitted to a care- 
ful analysis in terms of the discriminative 
stimuli presented to the subject, then those 
conditions in which specific reinforcing con- 
tingencies are enhanced or made more promi- 
nent (the mand conditions) are always found 
to yield weaker control over the subsequent 
belief statements than those conditions in which 
the subject is given "no good reason" for en- 
gaging in the behavior. For example, other 
experiments have manipulated the degree of 
choice permitted the subject, degree of com- 
mitment evoked, amount of justification given, 
and the reinforcing properties of the experi- 
menter—all with the same results (Brehm and 
Cohen, 1962, pp. 303-306): when the contin- 
gencies of reinforcement are made less signifi- 
cant or more subtle, the non-mand properties 
of the behavior predominate and subsequent 
belief statements may be predicted by assuming 
them to be the inferences of an observer of 
the behavior. 

Other studies from the literature are also 
amenable to the self-credibility interpretation. 
King and Janis (1956) compared improvised 
versus non-improvised role playing in produc- 
ing belief change and found that subjects who 
read a communication and then played the role 
of a sincere advocate of the communication's 
point of view changed their beliefs more than 
those who read the fully prepared script aloud. 
It seems likely that the subjects who simply 
read the script clearly discriminated that their 
behavior was under the control of the text (cf. 
Skinner, 1957), whereas in the case of the sub- 
ject who actually played out the role, every 
measure possible was taken to insure that he 
was surrounded by stimuli that would charac 
teristically control the behaviors of an individ- 
ual who "believed" the point of view. It should 
also be noted that subjects who were permitted 
to “improvise” would be expected to select pre 
cisely those arguments that had initial tact 
properties for themselves. It seems consonant 
with our analysis, then, to find that these sub- 
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jects were more “persuasive” to themselves than 
the script readers. 

In an experiment by Scott (1957), pairs of 
students were asked to debate different issues 
in front of a class; each student argued the 
position contrary to his initial beliefs. The ex- 
perimenter manipulated the conditions so that 
a predetermined member of each pair “won” 
the debate. Later measurement of belief showed 
that the “winners” changed in the direction of 
the position that they had defended in the de- 
bate, but "losers" did not; in fact, "losers" 
shifted slightly toward the “winning” side. In 
a replication of the experiment in which a 
panel of judges rather than a class vote "de- 
cided" the winner of the debates, some of the 
debaters defended positions they actually held 
(Scott, 1959). Among these subjects, "winners" 
shifted still further in the direction of their 
arguments, and "losers" again shifted slightly in 
the direction of the "winning" side, a shift that 
was against the arguments they themselves pre- 
sented and their initial beliefs. 

Since the winners had been reinforced by 
winning and the losers had not, Scott interprets 
these results as showing that the reinforcement 
of overt verbal responses leads to a change in 
the beliefs. Such an interpretation, however, 
runs into difficulty in accounting for the results 
of the dissonance experiments previously de- 
scribed, which show an inverse relation between 
reinforcement and belief change. The present 
analysis suggests that the (falsified) class vote or 
judge's decision gave an additional tact prop- 
erty, or measure of credibility, to the beliefs 
stated in the "winning" argument; hence, the 
“winner” himself discriminated these beliefs as 
tacts, which is to say as his “true” beliefs. The 
fact that “losers” also shifted slightly toward 
the winning side is consistent with this inter- 
pretation, Further support for this interpreta- 
tion comes from a study by Kelley and 
Woodruff (1956). College students who heard a 
recording of "prestigeful" members of their 
college group applaud a speech that opposed 
their initial beliefs changed their beliefs more 
than did a control group that was told that the 
applauding audience was a group of towns 
people. These subjects thus based their credi- 
bility discriminations of the communicator on 
the judgment of other listeners, just as the de- 
baters in the debate studies, it is suggested, 
based the credibility of their own communica- 
tions on the decisions of the judges. The self- 
perception interpretation is thus consistent with 


the data from both the debate experiments and 
the dissonance theory experiments. 


EXPERIMENT III. SELF-CREDIBILITY AND 
THE CONTROL OF ATTITUDE STATEMENTS 


The stimulus operations that we have inter- 
preted as controlling self-credibility discrimina- 
tions in all the experiments discussed and 
replicated so far have always had other func- 
tional properties. Thus, money payoffs and the 
winning of debates also have reinforcing prop- 
erties; manipulations of justification, choice, 
commitment, etc. involve a veritable tangle of 
complex and ambiguous stimulus operations. 
The present experiment was designed to pro- 
vide more direct support for the present for- 
mulation by "raising stimuli from birth" in the 
laboratory that would have no functional prop- 
erties other than those we have imputed to the 
stimulus operations in the other experiments 
discussed. 

The dependent variable in this study is an 
attitude statement concerning the "funniness" 
of magazine cartoons. The cartoons employed 
are those that the $ has previously marked as 
"neutral"; that is, neither funny nor unfunny. 
During the experimental session, which is dis- 
guised as a tape recording session for the prep- 
aration of experimental stimulus materials, the 
S is required to state that each cartoon is either 
"very funny" or "very unfunny." Each overt 
statement is made in the presence of a visual 
stimulus (a colored light) that has been pre- 
viously paired with the $'s verbal behavior: one 
of the colored lights, which we shall call the 
truth light, had always been illuminated pre- 
viously when the $ was giving true answers to 
a set of questions concerning personal informa- 
tion. The other light was always illuminated 
when the S was giving false answers to the ques- 
tions; we shall call this light the lie light. After 
the $ has made an overt statement about the 
cartoon, he rates his "true" attitude on a rating 
scale. 

The prediction is that the S's "true" attitude 
will be controlled by his overt statements to a 
greater degree when the statement is emitted in 
the presence of the truth light than when it is 
emitted in the presence of the lie light. A post- 
experimental questionnaire tests for the S's 
"awareness" of any of the obtained effects. It 
will be noted that each S is his own control, 
and that each S provides a complete replication 
of the experiment. 
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Method 

Eight college students were hired for two 
forty-minute sessions on consecutive days to 
“help prepare materials for use in experimental 
research on voice-judgment.” In the first session, 
the S$ ranked each of 200 cartoons for “funni- 
ness” on the 100 millimeter scale shown below 
by drawing a short vertical line across the scale 
line at any appropriate point. 


eee | 
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UNFUNNY 


Cartoon W-2 UNFUNNY 


The cartoons, arranged two to a page in a 
loose-leaf notebook, were taken from The New 
Yorker, The Saturday Evening Post, Look, and 
McCalls magazines. This session, then, provided 
a baseline measure for the dependent variable, 
ratings of funniness of cartoons. 

In the experimental session the following 
day, the S was seated at a desk in a small acous- 
tically-tiled recording room for the alleged pur- 
pose of making a tape recording of his voice 
for the voice-judgment experiments. The $'s 
first task was to fill out a form with 49 ques- 
tions concerning personal information. ‘The in- 
structions and sample items from this form are 
reproduced below: 


This information form will provide some of 
the materials you will be recording on tape for 
the voice-judgment experiment. It should be 
filled out completely and accurately. THIS IN- 
FORMATION WILL REMAIN CONFIDEN- 
TIAL AND ANONYMOUS. YOUR NAME 
WILL NOT APPEAR ON THE TAPE OR 
ELSEWHERE IN THE EXPERIMENT. 


(1) First Name 
(10) Are you generally favorable or unfavor- 
able to fraternities and sororities? 
(22) Do you feel that full socialism would be 
preferable to the free enterprise system 
in America? 
(29) Do you believe in a Supreme Be- 
ing? 


After obtaining the completed information 
form, E left the recording room, and all further 
communication with the S was conducted with 
an intercom. The following training procedure 
was then employed to establish two lights as 
discriminative stimuli which would indicate 
that verbal behavior in the presence of the one 
was tacting ("truth-telling") and in the presence 
of the other, lying, a form of behavior which is 


characteristically a mand. The following in- 
structions were given to the $:! 


As I mentioned, you are going to be making 
a tape of your own voice to be used in some 
research we will be doing on an individual's 
ability to judge another person's voice. In par- 
ticular, we are going to be examining an indi- 
vidual's ability to judge whether the speaker 


| 
FUNNY VERY 
FUNNY 


on the tape is telling the truth or not. To do 
this, some of the things you will say on the tape 
will be true statements; others will be untrue. 
The procedure will be as follows: I will ask you 
questions, one at a time, from the list of infor- 
mation you just filled out. After I ask you a 
question, I will start the tape recorder, and you 
should answer the question into the microphone 
in front of you. Whenever I turn on the tape 
recorder, one of two colored light bulbs in the 
ceiling fixture will also go on automatically. 
If the amber light goes on [amber light turned 
on], you are to answer the question truthfully; 
if, however, the green light turns on [green light 
turned on; amber light turned off], you should 
make up an untrue answer and speak it into 
the microphone as convincingly and as natu- 
rally as possible. My questions will not be re- 
corded on the tape, so your answers must be 
complete statements, not just single word an- 
swers, For example, I will ask: “What is your 
first name?” When the ceiling light goes on, you 
should answer, “My first name is such-and-such." 
If the light is amber, then you would, of course, 
give your real first name. If the light is green, 
you would make up some other name. As you 
can see, we wanted this to be spontaneous, 
which is why you will not know until the tape 
actually starts whether you are going to give à 
true or an untrue answer; you have to be on 
your toes. The lighting circuitry is set to select 
the two colored lights automatically and in ran- 
dom sequence. I will be checking your responses 
on your information form; when you respond 
in the appropriate way, I will stop the tape, 
the colored light will go out, and we will pro- 
ceed to the next question. If you happen to 
make a mistake, or do not answer with a com- 


1 Instructions were designed to sound matter-of- 
fact so that it would not be apparent to S$ that they 
were being read verbatim. Hence, in written form 
they seem wordy and ungrammatical. 
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plete sentence we will repeat that item. Are 
there any questions? [pause] Okay, remember 
the amber light means you are to give a true 
answer; the green light, an untrue one. 


The training procedure then proceeded as 
described. Half of the questions required untrue 
responses, and half required true responses. The 
two lights were reversed for some Ss: green light 
for true responses, amber light for false re- 
sponses. At the end of the training session, E 
continued as follows: 


We have now completed all the questions on 
the information form. In the second part of our 
voice-judgment experiments, the subjects will 
be asked to look at some of the cartoons you 


VERY 
UNFUNNY 


judged earlier. The cartoons we have decided 
to use have been placed in the notebook in 
front of you. The subjects will be asked to give 
their opinions of each cartoon on a rating scale 
like the one on the notebook cover by drawing 
a line through the scale line at the appropriate 
point. You will note that this scale is less de- 
tailed than the ones you used earlier. [See scale 
below.] Before judging each cartoon, the sub- 
ject will listen to a comment made by you on 
the tape stating that you found the cartoon 
very funny or very unfunny. We are interested 
in seeing whether your comment can influence 
the subject's opinion of the cartoon in the di- 
rection of your own opinion. Our procedure 
now, then, will be as follows: You will glance at 
each cartoon and decide whether it is funny 
or unfunny, that is, whether in your opinion it 
falls to the right or to the left of the neutral 
point on the scale. When you decide, tell me 
your decision and I will start the tape recorder. 
If you think the cartoon is a funny one, you 
should then record the comment, "I think car- 
toon such-and-such is a very funny cartoon." If 
the cartoon lies on the left side of the scale in 
your opinion, you should record the comment, 
"I think cartoon such-and-such is a very un- 
funny cartoon." In every case you should iden- 
tify the cartoon by its code letter and say that 


it is either very funny or very unfunny, which- 
ever is closer to your own opinion. Aside from 
that, however, you should use whatever words 
seem most natural and convincing to you, such 
as "I find cartoon such-and-such to be etc.” or 
“I think cartoon such-and-such is . . . etc." Are 
there any questions? [pause] Just so you will 
know when I turn the tape recorder on and 
off, I will leave the ceiling recording lights 
hooked up to the tape recorder; so, the two 
colored lights will continue to flash on and off 
with the recorder in random sequence. Okay, 
glance at the first cartoon and let me know 
which half of the scale it falls in, and I will 
start the recorder for your comment. 


The following scale was used in this session: 


VERY 
FUNNY 


At this point, $ glanced at the first cartoon, 
and indicated his binary decision. E then turned 
on the lie light; $ made his comment, the col- 
ored light went out, and the white desk light 
went on simultaneously. E continued as follows: 


It is also necessary that we have a record of 
your exact opinion of the cartoon at the time 
the comment was recorded. For this reason, you 
should now turn the page and mark your exact 
opinion of the cartoon on the scale provided by 
placing a mark through the scale line at the 
appropriate point. [E waited for this to be 
done.] Okay, turn to the second cartoon and 
tell me which half of the scale it falls in. 


For each S, the twenty cartoons employed in 
this session had been ranked within ten milli- 
meters of the neutral point (that is, between 
SLIGHTLY FUNNY and SLIGHTLY UN- 
FUNNY), on the scale presented in the pre- 
experimental session. The cartoons were further 
matched and alternated in the sequence of pre- 
sentation to control for the smaller variations 
around the neutral point. (This precaution was 
probably unnecessary because of the high vari- 
ability of such rankings over time.) 

The following sequence of lights was used 
for the 20 cartoons: 


l. Lie 5. Truth 9. Lie 13. Truth 17. Lie 
2. Truth 6. Lie 10. Truth 14. Lie 18. Truth 
3. Truth 7. Lie 11. Truth 15. Lie 19. Truth 
4. Lie 8. Truth 12. Lie 16. Truth 20. Lie 
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After the $ followed the procedure for the 
first cartoon, E did not communicate further 
with him except to announce the code number 
of the next cartoon after the $ had completed 
his ranking of the previous one. 

This procedure, then, assessed the control of 
attitudes exercised by overt verbal behavior 
emitted in the presence of two discriminative 
stimuli, one of which had a history of pairing 
with true responses, the other with false re- 
sponses. In order to assess awareness of the 
effects, in the event that they occurred, each S 
was instructed to respond in writing to the fol- 
lowing four questions at the end of the session. 


Each of the attached sheets contains a ques- 
tion which will be helpful to us in revising our 
procedures for the voice-judgment experiment. 
Please do not look ahead to the next question 
until you finish the preceding one. 

As far as you can tell, did your ratings of the 
20 cartoons you looked at today change from 
your previous ratings of them? If so, in what 
way? 

As far as you can tell, were your cartoon rat- 
ings today affected in any way by the other 
procedures involved? If so, in what way? 

Did you pay attention to the color of the 
recording lights during the cartoon portion of 
the session? As far as you can tell, did the re- 
cording lights affect your cartoon judgments in 
any way? If so, in what ways? 


Each $ was then paid $2.50 and told the true 
purpose of the experiment. 


Results and Discussion 


The prediction is that the 10 cartoons com- 
mented upon by the $ in the presence of the 
truth light would be ranked further from the 
neutral point in the direction of the self-persua- 
sion ("very funny" or "very unfunny”) than the 
10 cartoons commented upon by the $ in the 
presence of the lie light. This one-tailed hy- 
pothesis is tested on each $ with the Mann- 
Whitney U test. 

Because separate analyses of attitudes toward 
the cartoons called "very funny" and "very un- 
funny" yielded comparable results, these data 
have been combined; the scores given in Table 
1 represent mean absolute deviation in milli- 
meters from the neutral point (0) in the direc- 
tion of the self-persuasion on the 100 millimeter 
scale. Accordingly, the scores can range from 0 


to 50, where a score of 50 represents a rating 
of either "very funny" or "very unfunny," de- 
pending upon the S's overt statement. The S's 
responses to the final awareness question are 
also displayed. 

It is seen in Table 1 that seven of the eight 
Ss were persuaded to a greater extent by com- 
ments made in the presence of the truth light 
than by comments made in the presence of the 
lie light. The null hypothesis (no difference be- 
tween attitude ratings affected by the truth 
versus lie treatments) is rejected by a sign test 
applied to the relative position of these ratings 
on the scale of each S (p = .035; one-tailed). In 
addition, the results from five of the Ss consid- 
ered individually attain a significance level 
smaller than 5% by a one-tailed Mann-Whitney 
U test. 

The reverse trend displayed by S 7 is due pri- 
marily to the final cartoon—a lie-light cartoon 
—which was marked at 48 (VERY FUNNY). 
On the awareness questionnaire she responded: 
“I didn't think any of them were too funny 
so toward the end maybe I tried to think that 
they were funny so I wouldn't seem so un- 
happy." 

In pretesting the experimental procedures, a 
foreign student (Chinese undergraduate) served 
as an S. A slightly different procedure was em- 
ployed with him in that only twelve cartoons, 
which he had previously marked as neutral, 
were employed, and Æ dictated the comment 
which was to be made for each cartoon rather 
than first obtaining the S's preference. For this 
reason, the attitude rating can be displaced from 
the neutral point in the negative direction, that 
is, in the direction away from the overt com- 
ment “very funny” or “very unfunny.” The 
mean deviation from the neutral point for the 
lie-light cartoons was —9; the mean deviation 
for the truth-light cartoons was +28, a differ- 
ence significant at the .004 level. In addition 
to indicating no awareness of the effect of the 
lights, he commented that “Some of the cartoons 
are not too obvious in their meanings. So it is 
quite hard to give them the correct ratings.” 
The powerful control exercised by the lights 
over the attitude statements of this S is thus 
interpreted as reflecting the effect of the weak 
stimulus control exercised by the cartoons them- 
selves (“stimulus ambiguity”). The control is 
even more striking because E dictated the com- 
ment which was to be made for each cartoon, 
rather than first obtaining the S's preference: 
this adds a strong mand property to the subse- 
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quent behavior. (It will be recalled that a ma- 
nipulation of "amount of choice" was cited 
earlier as a demonstrated manipulation of the 
self-credibility parameter.) In fact, when this 
same procedure was attempted with ten Amer- 
ican S’s, no effect of stimulus control by the 
lights was obtained. These interpretations are 
not, of course, unequivocal since the cultural 
differences between this $ and the American Ss 
would imply variation of many more param- 
eters than just the ambiguity of the cartoons. 

None of the Ss in the present experiment was 
able to identify the effect of his comments or 
the lights on his subsequent attitude statements. 
Again, it is evident that “self-awareness” is a 
set of behaviors which must be learned from a 
socializing community that sets up the neces- 
sary contingencies of reinforcement for estab- 
lishing the discriminations (Skinner, 1953). In 
the present experiment, there is no theoretical 
reason to suppose that an S's tact of the condi- 
tions controlling his behavior should be caus- 
ally relevant to the outcome of the experiment. 

In sum, it is concluded that the data from 
the experiments reported provide support for 
the hypotheses advanced at the outset: self- 
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49. The Functional Approach to the Study of Attitudes 


DANIEL KATZ 


EARLY APPROACHES TO THE STUDY OF 
ATTITUDE AND OPINION 


There have been two main streams of think- 
ing with respect to the determination of man’s 
attitudes. The one tradition assumes an irra- 
tional model of man: specifically it holds that 
men have very limited powers of reason and 
reflection, weak capacity to discriminate, only 
the most primitive self-insight, and very short 
memories. Whatever mental capacities people 
do possess are easily overwhelmed by emotional 
forces and appeals to self-interest and vanity. 
The early books on the psychology of advertis- 
ing, with their emphasis on the doctrine of sug- 
gestion, exemplify this approach. One expres- 
sion of this philosophy is in the propagandist's 
concern with tricks and traps to manipulate the 
public. A modern form of it appears in The 
Hidden Persuaders, or the use of subliminal 
and marginal suggestion, or the devices sup- 
posedly employed by “the Madison Avenue 
boys." Experiments to support this line of think- 
ing started with laboratory demonstrations of 
the power of hypnotic suggestion and were soon 
extended to show that people would change 
their attitudes in an uncritical manner under 
the influence of the prestige of authority and 
numbers. For example, individuals would ac- 
cept or reject the same idea depending upon 
whether it came from a positive or a negative 
prestige source.! 

The second approach is that of the ideologist 
who invokes a rational model of man. It assumes 
that the human being has a cerebral cortex, that 
he seeks understanding, that he consistently 
attempts to make sense of the world about him, 

." 


* Excerpted from Public Opinion Quarterly, 1960, 
24, 163-204, with permission of the’author and the 
publisher, 


1 Muzafer Sherif, The Psychology of Social Norms, 
New York, Harper, 1936. 


that he possesses discriminating and reasoning 
powers which will assert themselves over time, 
and that he is capable of self-criticism and self- 
insight. It relies heavily upon getting adequate 
information to people. Our educational system 
is based upon this rational model. The present 
emphasis upon the improvement of communi- 
cation, upon developing more adequate chan- 
nels of two-way communication, of conferences 
and institutes, upon bringing people together 
to interchange ideas, are all indications of the 
belief in the importance of intelligence and 
comprehension in the formation and change of 
men's opinions. 

Now either school of thought can point to 
evidence which supports its assumptions, and 
can make fairly damaging criticisms of its op- 
ponent. Solomon Asch and his colleagues, in 
attacking the irrational model, have called at- 
tention to the biased character of the old ex- 
periments on prestige suggestion which gave the 
subject little opportunity to demonstrate crit- 
ical thinking.? And further exploration of sub- 
jects in these stupid situations does indicate 
that they try to make sense of a nonsensical 
matter as far as possible. Though the same 
statement is presented by the experimenter to 
two groups, the first time as coming from a 
positive source and the second time as coming 
from a negative source, it is given a different 
meaning dependent upon the context in which 
it appears? Thus the experimental subject does 


2 Solomon E. Asch, Social Psychology, New York, 
Prentice-Hall, 1952. 

3 Ibid., pp. 426-427. The following statement was 
attributed to its rightful author, John Adams, for 
some subjects and to Karl Marx for others: “those 
who hold and those who are without property have 
ever formed distinct interests in society.” When the 
statement was attributed to Marx, this type of com- 
ment appeared: “Marx is stressing the need for a 
redistribution of wealth.” When it was attributed 
to Adams, this comment appeared: “This social 
division is innate in mankind.” 
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his best to give some rational meaning to the 
problem. On the other hand, a large body of 
experimental work indicates that there are 
many limitations in the rational approach in 
that people see their world in terms of their 
own needs, remember what they want to re- 
member, and interpret information on the basis 
of wishful thinking. H. H. Hyman and P. 
Sheatsley have demonstrated that these experi- 
mental results have direct relevance to informa- 
tion campaigns directed at influencing public 
opinion.* These authors assembled facts about 
such campaigns and showed conclusively that 
increasing the flow of information to people 
does not necessarily increase the knowledge ab- 
sorbed or produce the attitude changes desired. 

The major difficulty with these conflicting 
approaches is their lack of specification of the 
conditions under which men do act as the the- 
ory would predict. For the facts are that people 
do act at times as if they had been decorticated 
and at times with intelligence and comprehen- 
sion. And people themselves do recognize that 
on occasion they have behaved blindly, im- 
pulsively, and thoughtlessly. A second major 
difficulty is that the rationality-irrationality di- 
mension is not clearly defined. At the extremes 
it is easy to point to examples, as in the case 
of the acceptance of stupid suggestions under 
emotional stress on the one hand, or brilliant 
problem solving on the other; but this does not 
provide adequate guidance for the many cases 
in the middle of the scale where one attempts 
to discriminate between rationalization and 
reason. 


RECONCILIATION OF THE CONFLICT IN A 
FUNCTIONAL APPROACH 


The conflict between the rationality and ir- 
rationality models was saved from becoming a 
worthless debate because of the experimenta- 
tion and research suggested by these models. 
The findings of this research pointed toward 
the elements of truth in each approach and 
gave some indication of the conditions under 
which each model could make fairly accurate 
predictions. In general the irrational approach 
was at its best where the situation imposed 
heavy restrictions upon search behavior and re- 
sponse alternatives. Where individuals must 


* Herbert H. Hyman and Paul B. Sheatsley, "Some 
Reasons Why Information Campaigns Fail,” Public 
Opinion Quarterly, Vol. 11, 1947, pp. 413-423. 


give quick responses without adequate oppor- 
tunities to explore the nature of the problem, 
where there are very few response alternatives 
available to them, where their own deep emo- 
tional needs are aroused, they will in general 
react much as does the unthinking subject 
under hypnosis. On the other hand, where the 
individual can have more adequate commerce 
with the relevant environmental setting, where 
he has time to obtain more feedback from his 
reality testing, and where he has a number of 
realistic choices, his behavior will reflect the use 
of his rational faculties.5 The child will often 
respond to the directive of the parent not by 
implicit obedience but by testing out whether 
or not the parent really meant what he said. 

Many of the papers in this issue, which de- 
scribe research and theory concerning consis- 
tency and consonance, represent one outcome 
of the rationality model. The theory of psy- 
chological consonance, or cognitive balance, 
assumes that man attempts to reduce discrepan- 
cies in his beliefs, attitudes, and behavior by 
appropriate changes in these processes. While 
the emphasis here is upon consistency or log- 
icality, the theory deals with all dissonances, no 
matter how produced. Thus they could result 
from irrational factors of distorted perception 
and wishful thinking as well as from rational 
factors of realistic appraisal of a problem and 
an accurate estimate of its consequences. More- 
over, the theory would predict only that the 
individual will move to reduce dissonance, 
whether such movement is a good adjustment 
to the world or leads to the delusional systems 
of the paranoiac. In a sense, then, this theory 
would avoid the conflict between the old ap- 
proaches of the rational and the irrational man 
by not dealing with the specific antecedent 
causes of behavior or with the particular ways 
in which the individual solves his problems. 

In addition to the present preoccupation with 
the development of formal models concerned 
with cognitive balance and consonance, there 
is a growing interest in a more comprehensive 
framework for dealing with the complex vari- 


5 William A. Scott points out that in the area of 
international relations the incompleteness and re- 
moteness of the information and the lack of pres- 
sures on the individual to defend his views results 
in inconsistencies. Inconsistent elements with re- 
spect to a system of international beliefs may, how- 
ever, be consistent with the larger system of the 
personality. "Rationality and Non-rationality of In- 
ternational Attitudes," Journal of Conflict Resolu- 
tion, Vol. 2, 1958, pp. 9-16. 
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ables and for bringing order within the field. 
The thoughtful system of Ulf Himmelstrand, 
presented in the following pages, is one such 
attempt. Another point of departure is repre- 
sented by two groups of workers who have or- 
ganized their theories around the functions 
which attitudes perform for the personality. 
Sarnoff, Katz, and McClintock, in taking this 
functional approach, have given primary atten- 
tion to the motivational bases of attitudes and 
the processes of attitude change.9 The basic 
assumption of this group is that both attitude 
formation and attitude change must be under- 
stood in terms of the needs they serve and that, 
as these motivational processes differ, so too will 
the conditions and techniques for attitude 
change. Smith, Bruner, and White have also 
analyzed the different functions which attitudes 
perform for the personality." Both groups pre- 
sent essentially the same functions, but Smith, 
Bruner, and White give more attention to per- 
ceptual and cognitive processes and Sarnoff, 
Katz, and McClintock to the specific conditions 
of attitude change. 

The importance of the functional approach 
is threefold: 


l. Many previous studies of attitude change 
have dealt with factors which are not genuine 
psychological variables, for example, the effect 
on group prejudice of contact between two 
groups, or the exposure of a group of subjects 
to a communication in the mass media. Now 
contact serves different psychological functions 
for the individual and merely knowing that 
people have seen a movie or watched a televi- 
sion program tells us nothing about the per- 
sonal values engaged or not engaged by such a 
presentation. If, however, we can gear our 
research to the functions attitudes perform, we 
can develop some generalizations about human 
behavior. Dealing with nonfunctional variables 
makes such generalization difficult, if not im- 
possible. 

2. By concerning ourselves with the different 
functions attitudes can perform we can avoid 
the great error of oversimplification—the error 
of attributing a single cause to given types of 
attitude. It was once popular to ascribe radical- 


* Irving Sarnoff and Daniel Katz, "The Motiva- 
tional Bases of Attitude Change,” Journal of Ab- 
normal and Social Psychology, Vol. 49, 1954, 
pp. 115-124. 

7 M. Brewster Smith, Jerome S. Bruner, and Robert 
W. White, Opinions and Personality, New York, 
Wiley, 1956. 


ism in economic and political matters to the 
psychopathology of the insecure and to attribute 
conservatism to the rigidity of the mentally 
aged. At the present time it is common practice 
to see in attitudes of group prejudice the re- 
pressed hostilities stemming from childhood 
frustrations, though Hyman and Sheatsley have 
pointed out that prejudiced attitudes can serve 
as normative function of gaining acceptance in 
one’s own group as readily as releasing uncon- 
scious hatred.5 In short, not only are there a 
number of motivational forces to take into ac- 
count in considering attitudes and behavior, 
but the same attitude can have a different mo- 
tivational basis in different people. 

3. Finally, recognition of the complex moti- 
vational sources of behavior can help to remedy 
the neglect in general theories which lack speci- 
fication of conditions under which given types 
of attitude will change. Gestalt theory tells us, 
for example, that attitudes will change to give 
better cognitive organization to the psycholog- 
ical field. This theoretical generalization is sug- 
gestive, but to carry out significant research we 
need some middle-level concepts to bridge the 
gap between a high level of abstraction and 
particularistic or phenotypical events. We need 
concepts that will point toward the types of 
motive and methods of motive satisfaction. 
which are operative in bringing about cogni- 
tive reorganization. 


Before we attempt a detailed analysis of the 
four major functions which attitudes can serve, 
it is appropriate to consider the nature of atti- 
tudes, their dimensions, and their relations to 
other psychological structures and processes. 


NATURE OF ATTITUDES: THEIR 
DIMENSIONS 


Attitude is the predisposition of the individ- 
ual to evaluate some symbol or object or aspect 
of his world in a favorable or unfavorable man- 
ner. Opinion is the verbal expression of an 
attitude, but attitudes can also be expressed in 
nonverbal behavior. Attitudes include both the 
affective, or feeling core of liking or disliking, 
and the cognitive, or belief, elements which de- 
scribe the object of the attitude, its characteris- 


8 Herbert H. Hyman and Paul B. Sheatsley, “The 
Authoritarian Personality: A Methodological Cri- 
tique," in Richard Christie and Marie Jahoda, edi- 
tors, Studies im the Scope and Method of the 
Authoritarian Personality, Glencoe, Ill., Free Press, 
1954, pp. 50-122. 
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tics, and its relations to other objects. All atti- 
tudes thus include beliefs, but not all beliefs 
are attitudes. When specific attitudes are orga- 
nized into a hierarchical structure, they com- 
prise value systems. Thus a person may not only 
hold specific attitudes against deficit spending 
and unbalanced budgets but may also have a 
systematic organization of such beliefs and atti- 
tudes in the form of a value system of economic 
conservatism. 

'The dimensions of attitudes can be stated 
more precisely if the above distinctions between 
beliefs and feelings and attitudes and value sys- 
tems are kept in mind. The intensity of an 
attitude refers to the strength of the affective 
component. In fact, rating scales and even 
Thurstone scales deal primarily with the inten- 
sity of feeling of the individual for or against 
some social object. The cognitive, or belief, 
component suggests two additional dimensions, 
the specificity or generality of the attitude and 
the degree of differentiation of the beliefs. 
Differentiation refers to the number of beliefs 
or cognitive items contained in the attitude, 
and the general assumption is that the simpler 
the attitude in cognitive structure the easier it 
is to change? For simple structures there is no 
defense in depth, and once a single item of be- 
lief has been changed the attitude will change. 
A rather different dimension of attitude is the 
number and strength of its linkages to a related 
value system. If an attitude favoring budget 
balancing by the Federal government is tied in 
strongly with a value system of economic con- 
servatism, it will be more difficult to change 
than if it were a fairly isolated attitude of the 
person. Finally, the relation of the value system 
to the personality is a consideration of first im- 
portance. If an attitude is tied to a value system 
which is closely related to, or which consists of, 
the individual's conception of himself, then the 
appropriate change procedures become more 
complex. The centrality of an attitude refers to 
its role as part of a value system which is closely 
related to the individual's self-concept. 

An additional aspect of attitudes is not clearly 
described in most theories, namely, their re- 
lation to action or overt behavior. "Though 
behavior related to the attitude has other de- 
terminants than the attitude itself, it is also 
true that some attitudes in themselves have 
more of what Cartwright calls an action struc- 


* David Krech and Richard S$, Crutchfield, Theory 
and Problems of Social Prychology, New York, 
McGraw-Hill, 1948, pp. 160-163. 


ture than do others.19 Brewster Smith refers to 
this dimension as policy orientation! and Katz 
and Stotland speak of it as the action compo- 
nent? For example, while many people have 
attitudes of approval toward one or the other 
of the two political parties, these attitudes will 
differ in their structure with respect to relevant 
action. One man may be prepared to vote on 
election day and will know where and when he 
should vote and will go to the polls no matter 
what the weather or how great the inconve- 
nience. Another man will only vote if a party 
worker calls for him in a car. Himmelstrand's 
work is concerned with all aspects of the rela- 
tionship between attitude and behavior, but he 
deals with the action structure of the attitude 
itself by distinguishing between attitudes where 
the affect is tied to verbal expression and atti- 
tudes where the affect is tied to behavior con- 
cerned with more objective referents of the 
attitude.1? In the first case an individual derives 
satisfaction from talking about a problem; in 
the second case he derives satisfaction from tak- 
ing some form of concrete action. 

Attempts to change attitudes can be directed 
primarily at the belief component or at the 
feeling, or affective, component. Rosenberg the- 
orizes that an effective change in one component 
will result in changes in the other component 
and presents experimental evidence to confirm 
this hypothesis.14 For example, a political can- 
didate will often attempt to win people by 
making them like him and dislike his opponent, 
and thus communicate affect rather than ideas. 
If he is successful, people will not only like 
him but entertain favorable beliefs about him. 
Another candidate may deal primarily with 
ideas and hope that, if he ‘can change people's 
beliefs about an issue, their feelings will also 
change. 


10 Dorwin Cartwright, "Some Principles of Mass 
Persuasion,” Human Relations, Vol, 2, 1949, pp. 
253-267. 

11 M. Brewster Smith, "The Personal Setting of 
Public Opinions: A Study of Attitudes toward 
Russia,” Public Opinion Quarterly, Vol. 11, 1947. 
pp. 507-523. 

12 Daniel Katz and Ezra Stotland, “A Preliminary 
Statement to a Theory of Attitude Structure and 
Change,” in Sigmund Koch, editor, Psychology: A 
Study of a Science, Vol. 3, New York, McGraw-Hill, 
1959, pp. 423-475. 

1» Ulf Himmelstrand, “Verbal Attitudes and Be- 
havior,” Public Opinion Quarterly, Vol. 24, 1960. 
pp. 224-250. 

14 Milton J. Rosenberg, “A Structural Theory of 
Attitude Dynamics,” Ibid. pp. 319-340. 


THE FUNCTIONAL APPROACH TO THE STUDY OF ATTITUDES 461 


FOUR FUNCTIONS WHICH ATTITUDES 
PERFORM FOR THE INDIVIDUAL 


The major functions which attitudes perform 
for the personality can be grouped according 
to their motivational basis as follows: 


1. The instrumental, adjustive, or utilitarian 
function upon which Jeremy Bentham and the 
utilitarians constructed their model of man. A 
modern expression of this approach can be 
found in behavioristic learning theory. 

2. The ego-defensive function in which the 
person protects himself from acknowledging the 
basic truths about himself or the harsh realities 
in his external world. Freudian psychology and 
neo-Freudian thinking have been preoccupied 
with this type of motivation and its outcomes. 

3. The value-expressive function in which 
the individual derives satisfactions from express- 
ing attitudes appropriate to his personal values 
and to his concept of himself. This function is 
central to doctrines of ego psychology which 
stress the importance of self-expression, self- 
development, and self-realization. 

4. The knowledge function based upon the 
individual's need to give adequate structure to 
his universe. The search for meaning, the need 
to understand, the trend toward better organi- 
zation of perceptions and beliefs to provide 
clarity and consistency for the individual, are 
other descriptions of this function. The devel- 
opment of principles about perceptual and cog- 
nitive structure have been the contribution of 
Gestalt psychology. 


Stated simply, the functional approach is the 
attempt to understand the reasons people hold 
the attitudes they do. The reasons, however, are 
at the level of psychological motivations and 
not of the accidents of external events and cir- 
cumstances. Unless we know the psychological 
need which is met by the holding of an attitude 
we are in a poor position to predict when and 
how it will change. Moreover, the same attitude 
expressed toward a political candidate may not 
perform the same function for all the people 
who express it, And while many attitudes are 
predominantly in the service of a single type of 
motivational process, as described above, other 
attitudes may serve more than one purpose for 
the individual. A fuller discussion of how atti- 
tudes serve the above four functions is in order. 


1. The Adjustment Function 


Essentially this function is a recognition of 
the fact that people strive to maximize the 


rewards in their external environment and to 
minimize the penalties. The child develops fa- 
vorable attitudes toward the objects in his world 
which are associated with the satisfactions of his 
needs and unfavorable attitudes toward objects 
which thwart him or punish him. Attitudes ac- 
quired in the service of the adjustment function 
are either the means for reaching the desired 
goal or avoiding the undesirable one, or are 
affective associations based upon experiences in 
attaining motive satisfactions.15 The attitudes 
of the worker favoring a political party which 
will advance his economic lot are an example 
of the first type of utilitarian attitude. The 
pleasant image one has of one’s favorite food 
is an example of the second type of utilitarian 
attitude. 

In general, then, the dynamics of attitude 
formation with respect to the adjustment func- 
tion are dependent upon present or past per- 
ceptions of the utility of the attitudinal object 
for the individual. The clarity, consistency, and 
nearness of rewards and punishments, as they 
relate to the individual’s activities and goals, 
are important factors in the acquisition of such 
attitudes. Both attitudes and habits are formed 
toward specific objects, people, and symbols as 
they satisfy specific needs. The closer these ob- 
jects are to actual need satisfaction and the 
more they are clearly perceived as relevant to 
need satisfaction, the greater are the probabili- 
ties of positive attitude formation. These prin- 
ciples of attitude formation are often observed 
in the breach rather than the compliance. In 
industry, management frequently expects to 
create favorable attitudes toward job perform- 
ance through programs for making the company 
more attractive to the worker, such as provid- 
ing recreational facilities and fringe benefits. 
Such programs, however, are much more likely 
to produce favorable attitudes toward the com- 
pany as a desirable place to work than toward 
performance on the job. The company benefits 
and advantages are applied across the board to 
all employees and are not specifically relevant 
to increased effort in task performance by the 
individual worker. 

Consistency of reward and punishment also 
contributes to the clarity of the instrumental 
object for goal attainment. If a political party 
bestows recognition and favors on party workers 
in an unpredictable and inconsistent fashion, it 
will destroy the favorable evaluation of the im- 


15 Katz and Stotland, op. cit., pp. 434-443. 
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portance of working hard for the party among 
those whose motivation is of the utilitarian 
sort. But, curiously, while consistency of reward 
needs to be observed, 100 per cent consistency 
is not as effective as a pattern which is usually 
consistent but in which there are some lapses. 
When animal or human subjects are invariably 
rewarded for a correct performance, they do not 
retain their learned responses as well as when 
the reward is sometimes skipped.16 


2. The Ego-Defensive Function 

People not only seek to make the most of 
their external world and what it offers, but they 
also expend a great deal of their energy on liv- 
ing with themselves. The mechanisms by which 
the individual protects his ego from his own 
unacceptable impulses and from the knowledge 
of threatening forces from without, and the 
methods by which he reduces his anxieties cre- 
ated by such problems, are known as mecha- 
nisms of ego defense. A more complete account 
of their origin and nature will be found in 
Sarnof|" They include the devices by which 
the individual avoids facing either the inner 
reality of the kind of person he is, or the 
outer reality of the dangers the world holds 
for him. They stem basically from internal 
conflict with its resulting insecurities. In one 
sense the mechanisms of defense are adap- 
tive in temporarily removing the sharp edges 
of conflict and in saving the individual from 
complete disaster. In another sense they are 
not adaptive in that they handicap the indi- 
vidual in his social adjustments and in ob- 
taining the maximum satisfactions available to 
him from the world in which he lives, The 
worker who persistently quarrels with his boss 
and with his fellow workers, because he is act- 
ing out some of his own internal conflicts, may 
in this manner relieve himself of some of the 
emotional tensions which beset him. He is not, 
however, solving his problem of adjusting to 
his work situation and thus may deprive himself 
of advancement or even of steady employment. 

Defense mechanisms, Miller and Swanson 
point out, may be classified into two families 
on the basis of the more or less primitive nature 
of the devices employed.!* The first family, 
16 William O. Jenkins and Julian C. Stanley, “Par- 
tial Reinforcement: A Review and Critique," Psy- 
chological Bulletin, Vol. 47, 1950, pp. 193-234. 
V Irving Sarnoff, “Psychoanalytic Theory and 
Social Attitudes,” Public Opinion Quarterly, Vol. 
24, 1960, pp. 251-279. 
18 Daniel R. Miller and Guy E. Swanson, Inner 


more primitive in nature, are more socially 
handicapping and consist of denial and com- 
plete avoidance. The individual in such cases 
obliterates through withdrawal and denial the 
realities which confront him. The exaggerated 
case of such primitive mechanisms is the fan- 
tasy world of the paranoiac. The second type 
of defense is less handicapping and makes for 
distortion rather than denial. It includes ra- 
tionalization, projection, and displacement. 

Many of our attitudes have the function of 
defending our self-image. When we cannot ad- 
mit to ourselves that we have deep feelings of 
inferiority we may project those feelings onto 
some convenient minority group and bolster 
our egos by attitudes of superiority toward 
this underprivileged group. The formation of 
such defensive attitudes differs in essential 
ways from the formation of attitudes which 
serve the adjustment function. They proceed 
from within the person, and the objects and 
situation to which they are attached are merely 
convenient outlets for their expression. Not 
all targets are equally satisfactory for a given 
defense mechanism, but the point is that the 
attitude is not created by the target but by 
the individual's emotional conflicts. And when 
no convenient target exists the individual will 
create one. Utilitarian attitudes, on the other 
hand, are formed with specific reference to the 
nature of the attitudinal object. They are thus 
appropriate to the nature of the social world 
to which they are geared. The high school stu- 
dent who values high grades because he wants 
to be admitted to a good college has a utili- 
tarian attitude appropriate to the situation to 
which it is related. 

All people employ defense mechanisms, but 
they differ with respect to the extent that they 
use them and some of their attitudes may be 
more defensive in function than others. It 
follows that the techniques and conditions for 
attitude change will not be the same for ego- 
defensive as for utilitarian attitudes. 

Morcover, though people are ordinarily un- 
aware of their defense mechanisms, especially 
at the time of employing them, they differ 
with respect to the amount of insight they may 
show at some later time about their use of 
defenses. In some cases they recognize that they 
have been protecting their egos without know- 
ing the reáson why. In other cases they may 


Conflict and Defense, New York, Holt, 1960, 
pp. 194-288 
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not even be aware of the devices they have 
been using to delude themselves. 


3. The Value-Expressive Function 


While many attitudes have the function of 
preventing the individual from revealing to 
himself and others his true nature, other atti- 
tudes have the function of giving positive ex- 
pression to his central values and to the type 
of person he conceives himself to be. A man 
may consider himself to be an enlightened 
conservative or an internationalist or a liberal, 
and will hold attitudes which are the appro- 
priate indication of his central values. Thus 
we need to take account of the fact that not 
all behavior has the negative function of reduc- 
ing the tensions of biological drives or of in- 
ternal conflicts. Satisfactions also accrue to the 
person from the expression of attitudes which 
reflect his cherished beliefs and his self-image. 
The reward to the person in these instances 
is not so much a matter of gaining social recog- 
nition or monetary rewards as of establishing 
his self-identity and confirming his notion of 
the sort of person he sees himself to be. The 
gratifications obtained from value expression 
may go beyond the confirmation of self-identity. 
Just as we find satisfaction in the exercise of 
our talents and abilities, so we find reward in 
the expression of any attributes associated 
with our egos. 

Value-expressive attitudes not only give clar- 
ity to the self-image but also mold that self- 
image closer to the heart’s desire. The teenager 
who by dress and speech establishes his iden- 
tity as similar to his own peer group may ap- 
pear to the outsider a weakling and a craven 
conformer. To himself he is asserting his inde- 
pendence of the adult world to which he has 
rendered childlike subservience and conform- 
ity all his life. Very early in the development 
of the personality the need for clarity of self- 
image is important—the need to know “who 
I am.” Later it may be even more important 
to know that in some measure I am the type 
of person I want to be. Even as adults, how- 
ever, the clarity and stability of the self-image 
is of primary significance. Just as the kind, 
considerate person will cover over his acts of 
selfishness, so too will the ruthless individualist 
become confused and embarrassed by his acts 
of sympathetic compassion. One reason it is 
difficult to change the character of the adult 
is that he is not comfortable with the new 
"me." Group support for such personality 


change is almost a necessity, as in Alcoholics 
Anonymous, so that the individual is aware of 
approval of his new self by people who are 
like him. 

The socialization process during the forma- 
tive years sets the basic outlines for the indi- 
vidual's self-concept. Parents constantly hold 
up before the child the model of the good 
character they want him to be. A good boy 
eats his spinach, does not hit girls, etc. The 
candy and the stick are less in evidence in 
training the child than the constant appeal to 
his notion of his own character. It is small 
wonder, then, that children reflect the accep- 
tance of this model by inquiring about the 
characters of the actors in every drama, whether 
it be a television play, a political contest, or 
a war, wanting to know who are the "good 
guys" and who are the "bad guys." Even as 
adults we persist in labeling others in the terms 
of such character images. Joe McCarthy and 
his cause collapsed in fantastic fashion when 
the telecast of the Army hearings showed him 
in the role of the villain attacking the gentle, 
good man represented by Joseph Welch. 

A related but somewhat different process 
from childhood socialization takes place when 
individuals enter a new group or organization. 
The individual will often take over and inter- 
nalize the values of the group. What accounts, 
however, for the fact that sometimes this occurs 
and sometimes it does not? Four factors are 
probably operative, and some combination of 
them may be necessary for internalization. 
(1) The values of the new group may be highly 
consistent with existing values central to the 
personality. The girl who enters the nursing 
profession finds it congenial to consider herself 
a good nurse because of previous values of the 
importance of contributing to the welfare of 
others. (2) The new group may in its ideology 
have a clear model of what the good group 
member should be like and may persistently 
indoctrinate group members in these terms. 
One of the reasons for the code of conduct 
for members of the armed forces, devised after 
the revelations about the conduct of American 
prisoners in the Korean War, was to attempt 
to establish a model for what a good soldier 
does and does not do. (3) The activities of the 
group in moving toward its goal permit the 
individual genuine opportunity for participa- 
tion. To become ego-involved so that he can 
internalize group values, the new member must 
find one of two conditions. The group activity 
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open to him must tap his talents and abilities 
so that his chance to show what he is worth 
can be tied into the group effort. Or else the 
activities of the group must give him an active 
voice in group decisions. His particular talents 
and abilities may not be tapped but he does 
have the opportunity to enter into group deci- 
sions, and thus his need for self-determination 
is satisfied. He then identifies with the group 
in which such opportunities for ego-involve- 
ment are available. It is not necessary that 
opportunities for self-expression and self- 


determination be of great magnitude in an ` 


objective sense, so long as they are important 
for the psychological economy of the individ- 
uals themselves. (4) Finally, the individual may 
come to see himself as a group member if he 
can share in the rewards of group activity 
which includes his own efforts. The worker 
may not play much of a part in building a 
ship or make any decisions in the process of 
building it. Nevertheless, if he and his fellow 
workers are given a share in every boat they 
build and a return on the proceeds from the 
earnings of the ship, they may soon come to 
identify with the ship-building company and 
see themselves as builders of ships. 


4. The Knowledge Function 


Individuals not only acquire beliefs in the 
interest of satisfying various specific needs, they 
also seek knowledge to give meaning to what 
would otherwise be an unorganized chaotic 
universe. People need standards or frames of 
reference for understanding their world, and 
attitudes help to supply such standards. The 
problem of understanding, as John Dewey 
made clear years ago, is one "of introducing 
(1) definiteness and distinction and (2) consis- 
tency and stability of meaning into what is 
otherwise vague and wavering."!? The definite- 
ness and stability are provided in good measure 
by the norms of our culture, which give the 
otherwise perplexed individual ready-made atti- 
tudes for comprehending his universe. Walter 
Lippmann’s classical contribution to the study 
of opinions and attitudes was his description 
of stereotypes and the way they provided order 
and clarity for a bewildering set of complex- 
ities.?? The most interesting finding in Herzog's 
familiar study of the gratifications obtained by 
19 John Dewey, How We Think, New York: Mac- 
millan, 1910. 


20 Walter Lippmann, Public Opinion, New York, 
Macmillan, 1922. 


housewives in listening to daytime serials was 
the unsuspected role of information and ad- 
vice.21 The stories were liked "because they 
explained things to the inarticulate listener." 

The need to know does not of course imply 
that people are driven by a thirst for universal 
knowledge. The American public's appalling 
lack of political information has been docu- 
mented many times. In 1956, for example, only 
13 per cent of the people in Detroit could cor- 
rectly name the two United States Senators 
from the state of Michigan and only 18 per cent 
knew the name of their own Congressman.?? 
People are not avid seekers after knowledge as 
judged by what the educator or social reformer 
would desire. But they do want to understand 
the events which impinge directly on their own 
life. Moreover, many of the attitudes they have 
already acquired give them sufficient basis for 
interpreting much of what they perceive to be 
important for them. Our already existing 
stereotypes, in Lippmann's language, "are an 
ordered, more or less consistent picture of the 
world, to which our habits, our tastes, our 
capacities, our comforts and our hopes have 
adjusted themselves. They may not be a com- 
plete picture of the world, but they are a pic- 
ture of a possible world to which we are 
adapted.”28 It follows that new information 
will not modify old attitudes unless there is 
some inadequacy or incompleteness or inconsis- 
tency in the existing attitudinal structure as it 
relates to the perceptions of new situations. 

The articles by Cohen, Rosenberg, Osgood, 
and Zajonc?* discuss the process of attitude 
change with respect to inconsistencies and dis- 
crepancies in cognitive structure. 


DETERMINANTS OF ATTITUDE AROUSAL 
AND ATTITUDE CHANGE 


The problems of attitude arousal and of atti- 
tude change are separate problems. The first 
has to do with the fact that the individual has 
many predispositions to act and many influ- 
ences playing upon him. Hence we need a 


21 Herta Herzog, “What Do We Really Know about 
Daytime Serial Listeners?” in Paul F. Lazarsfeld 
and Frank N. Stanton, editors, Radio Research 
1942-1943, New York, Duell, Sloan & Pearce, 1944, 
pp. 3-33. r 
22 From a study of the impact of party organization 
on political behavior in the Detroit area, by Daniel 
Katz and Samuel Eldersveld, in manuscript. 

23 Lippmann, op. cit., p. 95. 

24 Public Opinion Quarterly, Vol. 24, 1960. 


THE FUNCTIONAL APPROACH TO THE STUDY OF ATTITUDES 465 


more precise description of the appropriate 
conditions which will evoke a given attitude. 
The second problem is that of specifying the 
factors which will help to predict the modifi- 
cation of different types of attitude. 

The most general statement that can be made 
concerning attitude arousal is that it is depen- 
dent upon the excitation of some need in the 
individual, or some relevant cue in the environ- 
ment. When a man grows hungry, he talks of 
food. Even when not hungry he may express 
favorable attitudes toward a preferred food if 
an external stimulus cues him. The ego-defen- 
sive person who hates foreigners will express 
such attitudes under conditions of increased 
anxiety or threat or when a foreigner is per- 
ceived to be getting out of place. 

The most general statement that can be 
made about the conditions conducive to atti- 
tude change is that the expression of the old 
attitude or its anticipated expression no longer 
gives satisfaction to its related need state. In 
other words, it no longer serves its function 
and the individual feels blocked or frustrated. 
Modifying an old attitude or replacing it with 
a new one is a process of learning, and learn- 
ing always starts with a problem, or being 
thwarted in coping with a situation. Being 
blocked is a necessary, but not a sufficient, con- 
dition for attitude change. Other factors must 
be operative and will vary in effectiveness de- 
pending upon the function involved. 


GENERALIZATION OF ATTITUDE CHANGE 


Perhaps the most fascinating problem in atti- 
tude change has to do with consequences to 
a person's belief systems and general behavior 
of changing a single attitude. Is the change 
confined to the single target of the attitude? 
Does it affect related beliefs and feelings? If so, 
what types of related belief and feeling are 
affected, i.e, on what does the change rub off? 
Teachers and parents, for example, are con- 
cerned when a child acquires an immoral atti- 
tude or indulges in a single dishonest act, for 
fear of the pernicious spread of undesirable 
behavior tendencies. Responsible citizens are 
concerned about the lawless actions of extrem- 
ists in the South in combating integration, 
not only because of the immediate and specific 
implications of the behavior but because of 
the general threat to legal institutions. 

Research evidence on the generalization of 
attitude change is meager. In experimental 


work, the manipulations to produce change 
are weak and last for brief periods, sometimes 
minutes and at the most several hours. It is 
not surprising, therefore, that these studies re- 
port few cases of change which has generalized 
to attitudes other than the one under attack. 
Even in the studies on self-insight by Katz 
et al, where the change in prejudice toward 
Negroes was still in evidence some two months 
after the experiment, there were no consistent 
changes in prejudice toward other minority 
groups.?5 In real-life situations outside the labo- 
ratory, more powerful forces are often brought 
to bear to modify behavior, but again the re- 
sulting changes seem more limited than one 
would expect on an a priori logical basis. Inte- 
gration of whites and Negroes in the factory 
may produce acceptance of Negroes as fellow 
workers but not as residents in one's neighbor- 
hood, or as friends in one's social group. Sig- 
nificant numbers of Democrats were influenced 
by the candidacy of Dwight Eisenhower to help 
elect him President in 1952 and 1956, but, as 
Campbell et al. have established, this change 
in voting behavior did not rub off on the rest 
of the Republican ticket.?6 Most of the Demo- 
cratic defectors at the presidential level voted 
for a Democratic Congress. Nor did they change 
their attitudes on political issues. And the 
chances are that this change will not generalize 
to other Republican presidential candidates 
who lack Eisenhower's status as a national 
figure. 

It is puzzling that attitude change seems to 
have slight generalization effects, when the 
evidence indicates considerable generalization 
in the organization of a person's beliefs and 
values. Studies of authoritarian and equali- 
tarian trends in personality do find consistent 
constellations of attitudes. It is true that the 
correlations are not always high, and Prothro 
reports that, among his Southern subjects, 
there was only a slight relationship between 
anti-Semitism and Negro prejudice?" But 
studies of the generalization hypothesis in atti- 
tude structure give positive findings. Grace 
confirmed his prediction that the attitudes 
people displayed in interpersonal relations to- 
ward their friends and colleagues carried over 


26 Stotland and Katz, op. cit. 

26 Campbell, Converse, Miller, and Stokes, op. cit. 
27 E. Terry Prothro, “Ethnocentrism and Anti- 
Negro Attitudes in the Deep South,” Journal of 
Abnormal and Social Psychology, Vol. 47, 1952, 
pp. 105-108. 
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to their attitudes toward international mat- 
ters.28 He studied four types of reaction: verbal 
hostility, direct hostility, intropunitiveness, and 
apathy. People characteristically giving one 
type of response in everyday situations would 
tend to respond similarly in professional and 
international situations. Stagner concluded on 
the basis of his empirical investigation of atti- 
tudes toward colleagues and outgroups that the 
evidence supported a generalization theory 
rather than a displacement or sublimation 
theory.2® Confirmation of the generalization 
hypothesis comes from a Norwegian study by 
Christiansen in which reactions were classified 
on two dimensions: (1) threat-oriented versus 
problem-oriented and (2) outward-directed ver- 
sus inward-directed. Thus, blaming oneself 
would be a threat-oriented, inwardly directed 
reaction, Christiansen found that (a) people 
tend to react consistently toward everyday con- 
flict situations, (b) they react consistently to 
international conflicts, (c) there is a correlation 
between reactions to everyday conflicts and to 
international conflicts, and (d) this correlation 
is lower than the correlations among reactions 
to everyday conflicts and among reactions to 
international conflicts, respectively.9° 

Three reasons can be suggested for the failure 
to find greater generalization effects in attitude 
change: 


1, The over-all organization of attitudes and 
values in the personality is highly differenti- 
ated. The many dimensions allow the individ- 
ual to absorb change without major modification 
of his attitudes. A Democrat of long standing 
could vote for Eisenhower and still remain 
Democratic in his identification because to him 
politics was not involved in this decision. 
Eisenhower stood above the political arena in 
the minds of many people. He was not blamed 
for what his party did, as the Gallup polls indi- 
cate, nor did his popularity rub off on his party. 
In 1958, in spite of Eisenhower's urgings, the 
people returned a sizable Democratic majority 
to Congress, There are many standards of judg- 


28 H. A. Grace, A Study of the Expression of Hos- 
tility in Everyday Professional and International 
Verbal Situations, New York, Columbia University 
Press, 1949. 


?9 Ross Stagner, "Studies of Aggressive Social Atti- 
tudes,” Journal of Social Psychology, Vol. 20, 1944, 
pp. 109-120. 

30 Bjorn Christiansen, Attitudes towards Foreign 
Affairs as a Function of Personality, Oslo, Norway, 
Oslo University Press, 1959. 


ment, then, which pertain to content areas of 
belief and attitude. An individual uses one 
set of standards or dimensions for a political 
decision but will shift to another set when it 
is more appropriate. 

2. The generalization of attitudes proceeds 
along lines of the individual's own psychologi- 
cal groupings more than along lines of con- 
ventional sociological categories. We may miss 
significant generalized change because we do 
not look at the individual's own pattern of 
beliefs and values. One man may dislike for- 
eigners, but to him foreigners are those people 
whose English he cannot understand; to an- 
other person foreigners are people of certain 
physical characteristics; to a third they are 
people with different customs, etc. 

People will utilize many principles in organiz- 
ing their own groupings of attitudes: (a) the 
objective similarities of the referents of the 
attitudes, (b) their own limited experiences 
with these referents, (c) their own needs, and 
(d) their own ideas of causation and of the na- 
ture of proper relationships. Peak has used the 
concept of psychological distance and difference 
between events in psychological space to de- 
scribe attitude structure and generalization.9! 

The liberal-conservative dimension, for exam- 
ple, may be useful for characterizing large 
groups of people, but individuals may differ 
considerably in their own scaling of attitudes 
comprising liberalism-conservatism. Some con- 
servatives can stand to the left of center on 
issues of the legal rights of the individual or 
on internationalism. Social classes show differ- 
ences in liberal and conservative ideology, the 
lower socio-economic groups being more lib- 
eral on economic and political issues and the 
upper income groups more liberal on tolerance 
for deviants and on democratic values in inter- 
personal relationships. Stouffer found that dur- 
ing the McCarthy period the low-status groups 
were more intolerant, and other studies have 
shown more authoritarian values among these 
groups.2 

8. Generalization of attitude change is lim- 
ited by the lack of systematic forces in the 
social environment to implement that change. 


31 Helen Peak, “Psychological Structure and Per- 
son Perception," in Renato Tagiuri and Luigi 
Petrullo, editors, Person Perception and Interper- 
sonal Behavior, Stanford, Calif, Stanford Univer- 
sity Press, 1958, pp. 337-352. 

32 Samuel A. Stouffer, Communism, Conformity 
and Civil Liberties, New York, Doubleday, 1955. 
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Even when people are prepared to modify 
their behavior to a considerable extent they 
find themselves in situations which exert pres- 
sures to maintain old attitudes and habits. 
The discharged convict who is ready to change 
his ways may find it difficult to find a decent 
job and his only friends may be his former 
criminal associates. It does not necessarily help 
an industrial firm to train its foremen in hu- 
man relations if the foremen must perform in 
an authoritarian structure. 


ASSESSMENT OF MOTIVATIONAL BASES OF 
ATTITUDES 


If an understanding of the nature of atti- 
tudes and the conditions for their change de- 
pends upon a knowledge of their functional 
bases, then it becomes of first importance to 
identify the underlying motivational patterns. 
The traditional advertising approach is to give 
less attention to the research assessment of 
needs and motives and more attention to mul- 
tiple appeals, to gaining public attention, and 
to plugging what seems to work. Multiple ap- 
peals will, it is hoped, reach some members 
of the public with an effective message. In 
political campaigns, there is more concern with 
gearing the approach to the appropriate au- 
dience. If the political party makes serious 
mistakes in its assessment of the needs of par- 
ticular groups, it is not a matter of losing a 
few potential customers but of losing votes to 
the opposing party, and so losing the election. 
Political leaders are, therefore, making more 
and more use of public opinion polls and a 
number of the major candidates for high office 
enlist their own research specialists. So true is 
this that we may no longer have political con- 
ventions naming a dark-horse candidate for 
the presidency. If the leaders are not convinced 
by poll results that a candidate has a good 
chance to win, they are not likely to support 
him. 

There are no reliable short-cuts to the assess- 
ments of the needs which various attitudes sat- 
isfy. Systematic sampling of the population in 
question by means of interviews or of behav- 
ioral observation is a necessity. A growing 
number of devices are becoming available to 
supplement the depth interview. Objective 
scales for determining personality trends, such 
as the F-scale or the Minnesota Multiphasic 
Inventory, have been widely used. Projective 
methods which call for the completion of sen- 


tences and stories or furnishing stories about 
ambiguous pictures are just beginning to be 
exploited. In a nationwide survey of attitudes 
toward public health, Veroff et al. successfully 
used a picture test to obtain scores for people 
with respect to their needs for achievement, 
for affiliation, and for power? Methods for 
measuring motivation are difficult, but the ba- 
sic logic in their application is essentially that 
of any research tool. If early abuses of these 
instruments do not prejudice the research field, 
they will in the future have almost as wide a 
use as the polls themselves. Moreover, polling 
methods can be adapted to measuring people's 
needs with indirect questions which have been 
validated against more projective tests. 

In many situations inferences can be made 
about people’s needs without elaborate mea- 
sures. If farm income has fallen drastically in 
a given section of the country, or if unemploy- 
ment has risen sharply in a certain city, obvious 
inferences can be drawn. The extent and depth 
of the dissatisfaction will be better known 
through adequate measurement. 

Measures of the four types of motivational 
patterns discussed indicate wide individual dif- 
ferences in the extent to which the patterns 
characterize the person. Though all people 
employ defense mechanisms, there are wide 
differences in the depth and extent of defen- 
siveness. And Cohen has shown that the need 
for knowledge varies even in a college popu- 
lation.34 

In spite of characteristic differences in the 
strength of needs and motives, we cannot pre- 
dict attitude change with precision solely on 
the basis of measures of need. We must also 
have measures of the related attitudes. Knowl- 
edge of the need state indicates the type of 
goal toward which the individual is striving. 
But the means for reaching this goal may vary 
considerably, and for this reason we need to 
know the attitudes which reflect the evaluation 
of the various means. Farmers with depressed 
incomes may still vote for the Republican 
Party if they have confidence in Nixon's farm 
program. Some need patterns furnish more di- 


33 Joseph Veroff, John W. Atkinson, Sheila C. Feld, 
and Gerald Gurin, “The Use of Thematic Apper- 
ception to Assess Motivation in a Nationwide In- 
terview Study," Psychological Monographs, in press. 
34 Arthur R. Cohen, “Need for Cognition and 
Order of Communication as Determinants of Opin- 
ion Change," in Carl Hovland et al., editors, The 
Order of Presentation in Persuasion, New Haven, 
Conn., Yale University Press, 1957, pp. 79-97. 
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rect predictions than others. The defensive 
person who is extrapunitive will be high in 
prejudice toward outgroups. Even in this case, 
however, his prejudices toward specific out- 
groups may vary considerably. 


SUMMARY 


The purpose of this paper was to provide a 
psychological framework for the systematic 
consideration of the dynamics of public and 


private attitudes. Four functions which atti- 
tudes perform for the personality were iden- 
tified: the adjustive function of satisfying 
utilitarian needs, the ego-defensive function of 
handling internal conflicts, the value-expressive 
function of maintaining selfidentity and of 
enhancing the self-image, and the knowledge 
function of giving understanding and meaning 
to the ambiguities of the world about us. The 
role of these functions in attitude formation 
was described. 


50. Compliance, Identification, and Internalization: Three 
Processes of Attitude Change 


HERBERT C. KELMAN 


A crucial issue in communication research re- 
lates to the nature of changes (if any) that are 
brought about by a particular communication 
or type of communication. It is not enough to 
know that there has been some measurable 
change in attitude; usually we would also want 
to know what kind of change it is. Is it a super- 
ficial change, on a verbal level, which disap- 
pears after a short lapse of time? Or is it a 
more lasting change in attitude and belief, 
which manifests itself in a wide range of situ- 
ations and which is integrated into the person's 
value system? Or, to put it in other terms, did 
the communication produce public conformity 
without private acceptance, or did it produce 
public conformity coupled. with private accep- 
tance? (Cf. 1, 4) Only if we know something 
about the nature and depth of changes can 
we make meaningful predictions about the way 
in which attitude changes will be reflected in 
subsequent actions and reactions to events. 
These questions about the nature of attitude 
changes are highly significant in the study of 
international attitudes. For example, we may 
have observed changes in opinion toward cer- 
tain international issues—e.g. aspects of for- 
eign policy, international organization, or 
disarmament—among the population of a given 
country. The implications that we draw from 
these changes will depend on their depth and 


* Reprinted from Journal of Conflict Resolution, 
1958, 2, 51-60, with permission of the author and 
the publisher. The experiment reported here was 
conducted while the author was at Johns Hopkins 
University as a Public Health Service Research 
Fellow of the National Institute of Mental Health. 
Additional financial support was received from the 
Yale Communication Research Program, which is 
under the direction of Carl I. Hovland and which 
is operating under a grant from the Rockefeller 
Foundation. 


on the psychological meanings that can be as- 
signed to them. Let us assume that we find an 
increase in favorable attitudes toward the 
United Nations among the population of the 
United States at a particular juncture. This 
change in attitude may be due primarily to 
recent pronouncements by high-placed figures 
and may thus represent an aspect of “social 
conformity.” On the other hand, the change 
may result from a series of international events 
which have led large segments of the popu- 
lation to re-evaluate American foreign policy 
and to ascribe a more central role to the UN. 
Depending on which of these motivational 
processes underlies the change in attitude, we 
would make different predictions about the 
manifestations and consequences of the new 
attitudes: about their durability, about the 
number of different attitudinal areas that will 
be affected by them, and about the ways in 
which they will be translated into action and 
will determine reactions to international events. 
Similarly, our predictions about the subsequent 
history of the new attitudes will depend on 
their cognitive links, i.e., the particular attitude 
structure within which the new attitude to- 
ward the UN is imbedded. For example, 
Americans may have become more favorable 
toward the UN because an important resolution 
sponsored by the United States delegate has 
been accepted. The new attitude toward the 
UN is thus an aspect of attitudes toward one's 
own nation and its prestige and international 
success. On the other hand, favorableness to- 
ward the UN may have increased because UN 
action has successfully averted war in a very 
tense conflict situation. In this case, the new 
attitude toward the UN is imbedded in an 
attitude structure revolving around the whole 
question of war and effective means of pre- 
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venting its outbreak. Again, we would draw 
different implications from the changed atti- 
tudes, depending on which of these attitude 
areas was primarily involved in the occurrence 
of change. 

The same considerations apply when we 
interpret the effects of international communi- 
cations. For example, if we find changes in the 
way in which nationals of different countries 
perceive one another, it would be important 
to know at what level these changes have oc- 
curred and to what motivational and cognitive 
systems they are linked. These questions are 
important not only for the analysis of changes 
in attitude toward various international issues, 
objects, or events which may have occurred as 
a result of various kinds of communication or 
experience but also for the development of 
propositions about the conditions for change. 
In international relations, as in other areas of 
social behavior, one of our ultimate concerns 
is the exploration of the conditions under 
which lasting changes occur, changes which are 
generalized to many situations and which rep- 
resent some degree of value reorganization. 

In the present paper I should like to describe 
briefly an experimental study which is con- 
cerned with some of the conditions that deter- 
mine the nature of attitude changes produced 
by communications on social issues, The spe- 
cific content of the attitudes that were inves- 
tigated in this study was in the area of race 
relations rather than international relations. 
The hypotheses refer, however, to general pro- 
cesses of attitude change, irrespective of the 
specific attitudinal area. Relationships found 
should be equally applicable, therefore, to the 
analysis of international attitudes. 


THEORETICAL FRAMEWORK 


The experiment reported here grows out of 
a broader theoretical framework concerned 
with the analysis of different processes of atti- 
tude change resulting from social influence. It 
is impossible to present this framework in de- 
tail in the present paper, but I should like to 
outline its main features. 

The starting point of the theoretical analysis 
is the observation discussed in the preceding 
paragraphs, i.e, that changes in attitudes and 
actions produced by social influence may occur 


1 A detailed description of the theoretical frame- 


work and of the experiment reported here will 
published dienen 0 yero 


at different "levels." It is proposed that these 
differences in the nature or level of changes 
that take place correspond to differences in 
the process whereby the individual accepts in- 
fluence (or "conforms"). In other words, the 
underlying processes in which an individual 
engages when he adopts induced behavior may 
be different, even though the resulting overt 
behavior may appear the same. 

"Three different processes of influence can be 
distinguished: compliance, identification, and 
internalization.? 

Compliance can be said to occur when an 
individual accepts influence because he hopes 
to achieve a favorable reaction from another 
person or group. He adopts the induced be- 
havior not because he believes in its content 
but because he expects to gain specific rewards 
or approval and avoid specific punishments or 
disapproval by conforming. Thus the satisfac- 
tion derived from compliance is due to the 
social effect of accepting influence. 

Identification can be said to occur when an 
individual accepts influence because he wants 
to establish or maintain a satisfying self-defin- 
ing relationship to another person or a group. 
This relationship may take the form of classi- 
cal identification, in which the individual takes 
over the role of the other, or it may take the 
form of a reciprocal role relationship. The indi- 
vidual actually believes in the responses which 
he adopts through identification, but their spe- 
cific content is more or less irrelevant. He adopts 
the induced behavior because it is associated 
with the desired relationship. Thus the satisfac- 
tion derived from identification is due to the 
act of conforming as such. 

Internalization can be said to occur when 
an individual accepts influence because the 
content of the induced behavior—the ideas and 
actions of which it is composed—is intrinsically 
rewarding. He adopts the induced behavior 
because it is congruent with his value system. 
He may consider it useful for the solution of 
a problem or find it congenial to his needs. 
Behavior adopted in this fashion tends to be 
integrated with the individual's existing values. 
Thus the satisfaction derived from internaliza- 
tion is due to the content of the new behavior. 

The three processes represent three quali- 
tatively different ways of accepting influence. 
A systematic treatment of the processes might, 


? A similar distinction, between four processes of 
conformity, was recently presented by Marie Jahoda 
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therefore, begin with an analysis of the deter- 
minants of influence in general. These deter- 
minants can be summarized by the following 
proposition: The probability of accepting in- 
fluence is a combined function of (a) the rela- 
tive importance of the anticipated effect, (b) 
the relative power of the influencing agent, 
and (c) the prepotency of the induced response. 
A variety of experimental findings can be cited 
in support of this proposition. 

Compliance, identification, and internaliza- 
tion can each be represented as a function of 
these three determinants. For each process, 
however, these determinants take a qualita- 
tively different form. Thus the determinants 
of the three processes can be distinguished from 
one another in terms of the nature of the 
anticipated effect, the source of the influencing 
agent's power, and the manner in which the 
induced response has become prepotent. 

In other words, each process is characterized 
by a distinctive set of antecedent conditions, 
involving a particular qualitative variation of 
a more general set of determinants. Given the 
proper set of antecedents, then, influence will 
take the form of compliance, identification, or 
internalization, respectively. Each of these cor- 
responds to a characteristic pattern of internal 
responses (thoughts and feelings) in which the 
individual engages while adopting the induced 
behavior. 

Similarly, each process is characterized by a 
distinctive set of consequent conditions, involv- 
ing a particular qualitative variation in the 
subsequent history of the induced response. 
Responses adopted through different processes 
will be performed under different conditions, 
will be changed and extinguished under dif- 
ferent conditions, and will have different 
properties. 

Since each of the three processes mediates 
between a distinct set of antecedents and a dis- 
tinct set of consequents, the proposed distinc 
tions between the three processes can be tested 
by experiments which attempt to relate the 
antecedents postulated for a given process to 
the consequents postulated for that process. 
The present experiment was designed to vary 
one of the antecedents—the source of the in- 
fluencing agents power—and to observe the 
effects of this variation on one of the conse- 
quents—the conditions of performance of the 
induced response. 

Power is defined as the extent to which the 
influencing agent is perceived as instrumental 


to the achievement of the subject's goals. The 
sources of the agent’s power may vary (cf. 2). 
The following hypotheses are offered regarding 
the variations in source of power: 


1. To the extent to which the power of the 
influencing agent is based on means-control, 
conformity will tend to take the form of com- 
pliance. 

2. To the extent to which the power of the 
influencing agent is based on attractiveness, 
conformity will tend to take the form of identi- 
fication. 

3. To the extent to which the power of the 
influencing agent is based on credibility, con- 
formity will tend to take the form of inter- 
nalization. 


Now let us look at the consequent side. One 
of the ways in which behaviors adopted through 
different processes can be distinguished is in 
terms of the conditions under which the be- 
havior is performed. The following hypotheses 
are offered regarding the conditions of per- 
formance: 


1. When an individual adopts an induced 
response through compliance, he tends to per- 
form it only under conditions of surveillance 
by the influencing agent. 

2. When an individual adopts an induced 
response through identification, he tends to 
perform it only under conditions of salience 
of his relationship to the agent. 

3. When an individual adopts an induced 
response through internalization, he tends to 
perform it under conditions of relevance of the 
issue, regardless of surveillance or salience. 


PROCEDURE 


The subjects in this experiment were Negro 
college Freshmen in a border state. The ex- 
periment was conducted in the spring of 1954, 
just prior to the announcement of the Supreme 
Court decision on desegregation in the public 
schools. The social influence situation to which 
the students were exposed consisted of a fixed 
communication designed to change their atti- 
tudes on an issue related to the impending 
Court decision. Specifically, each of the com- 
munications employed in the study presented 
essentially the following message: If the Su- 
preme Court rules that segregation is uncon- 
stitutional, it would still be desirable to 
maintain some of the private Negro colleges 
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as all-Negro institutions, in order to preserve 
Negro culture, history, and tradition. Prelimi- 
nary testing indicated that a large majority of 
the subjects would initially oppose the message 
presented in the communication. 

The communications were tape-recorded in- 
terviews between a moderator and a guest (the 
communicator). They were presented to the 
subjects as recordings of radio programs which 
we were interested in evaluating. By varying 
the nature of these communications, it was 
possible to manipulate experimentally the 
source and degree of the communicator's 
power, while keeping the message of the com- 
munication constant. Four different communi- 
cations were used, as can be seen from Table 1, 
which outlines the basic design of the experi- 
ment (see lef-hand column). 

In one communication the attempt was made 
to present the communicator in such a way 
that he would be perceived as possessing high 
means-control. He was introduced as the presi- 
dent of the National Foundation for Negro 
Colleges. In the course of the interview it be- 
came evident that his foundation had been 
supporting the college in which the study was 
being conducted; that he had almost complete 
control over the funds expended by the foun- 
dation; and that he was the kind of person 
who would not hesitate to use his control in 
order to achieve conformity. He made it clear 
that he would withdraw foundation grants 
from any college in which the students took a 
position on the issue in question which was at 
variance with his own position. 

In the second communication the communi- 


cator was presented in such a way that he would 
be perceived as possessing high attractiveness. 
He was introduced as a Senior and president 
of the student council in a leading Negro uni- 
versity. He was also chairman of his university's 
chapter of an organization called Student Poll, 
which recently did a study on the attitudes of 
Negro college Seniors on issues relating to the 
Supreme Court decision. He presented the 
same message as the first communicator, but 
he made it clear that he was presenting not 
simply his own opinions but the overwhelming 
consensus of opinion of the college students 
represented in the polls. He was portrayed as 
a representative of one of the subjects' refer- 
ence groups and as a person who was in a 
position to supply valid information on the 
group norms. 

In the third communication the communi- 
cator was presented in such a way that he 
would be perceived as possessing high credi- 
bility. He was introduced as a professor of 
history in one of the country's leading univer- 
sities. In the course of the interview, it became 
evident that he was one of the top experts on 
the history and problems of minority groups; 
that he was highly respected both by his col- 
leagues and by members of minority groups; 
and that he had a profound concern for the 
welfare of the American Negro community. He 
presented the same position as the other speak- 
ers, but he made it clear that this position was 
based on his research and on the evidence of 
history. 

For purposes of comparison, a fourth com- 
munication was used in which the communi- 


TABLE 1. Design of the Experiment and Predictions* 


Questionnaires: Variations in Conditions 


Experimental groups: 


Variations in Surveillance 
Communicator Salience 
Power Issue-Relevance 

High power, based H 

on means-control 
High power, based H 

on attractiveness 
High power, based H 

on credibility 
Low-power L 


of Performance 


Questionnaire I Questionnaire II Questionnaire IH 


Nonsurveillance Non-surveillance 


Salience Non-salience 
Issue-Relevance — Issue-Relevance 
L L 
H L 
H H 
E L 


1425 duc ————— ——— 2 
* H — high probability that attitude will be expressed; L — low probability 


that attitude will be expressed. 
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cator was presented. in such a way that he 
would be perceived as possessing low power, 
that is, as being low in means-control, attrac- 
tiveness, and credibility. He was introduced as 
an "ordinary citizen," who had no control over 
the subjects and no special knowledge about 
the topic. His attractiveness and credibility 
were further undermined by the fact that he 
was portrayed as a white man with a southern 
accent, who had recently come from Missis- 
sippi and who did not clearly dissociate him- 
self from a pro-segregation point of view. 

Each of the four communications was pre- 
sented to a different experimental group. There 
was also a control group, which was not ex- 
posed to any communication at all. This group 
provided a base line for postcommunication 
attitudes, since we did not obtain before-scores 
from the experimental subjects. 

After exposure to the communication, the 
subjects in each experimental group filled out 
attitude questionnaires, designed to measure 
the extent of their agreement with the com- 
municator. The questionnaire consisted of 
eighteen statements which were directly or in- 
directly related to the central message of the 
communication. Subjects were asked to respond 
to each item on a six-point scale. Scores for 
each item ranged from 1 for extreme disagree- 
ment with the communicator's position to 7 
for extreme agreement (the score of 4 was not 
used, since the scale did not contain a neutral 
position). A total score for each subject was 
computed by adding up the individual scores 
on the 18 items. Thus the range of possible 
total scores was from 18 (complete disagree- 
ment) to 126 (complete agreement). 

To vary the conditions of performance, we 
presented these eighteen statements to each 
subject on three separate questionnaires. As 
can be seen from Table 1, it is assumed that 
issue-relevance was present on all three ques- 
tionnaires, since the attitude items dealt with 
the issues involved in the communication. The 
questionnaires differed, however, with respect 
to surveillance and salience. 

Questionnaire I was completed under con- 
ditions of salience and surveillance. In order 
to maximize the salience of the communicator 
and the communication, the questionnaire was 
administered immediately after the communi- 
cation. In order to maximize surveillance by 
the communicator, it was made clear—both by 
the appearance of the questionnaire and by 
the instructions—that this questionnaire was 


being administered at the communicator's re- 
quest and that he would see each subject's 
answers. Moreover, the subjects were asked to 
sign their names and to give other identifying 
information. 

Questionnaire II was completed under con- 
ditions of salience and non-surveillance. In 
order to maximize salience, the questionnaire 
was administered immediately after Question- 
naire I, so that the relationship between the 
questionnaire and the communication was still 
clearly apparent. In order to minimize sur- 
veillance, Questionnaire II was anonymous, 
and it was made clear to the subjects that 
their responses would not be seen by the com- 
municator or by anyone other than the research 
staff. 

Questionnaire III was completed under con- 
ditions of non-salience and non-surveillance. 
In order to minimize the salience of the com- 
municator and the communication, the ques- 
tionnaire was administered from one to two 
weeks after the communication session, in a 
different place, under different auspices, and 
by a different experimenter. The questions 
were imbedded in a large number of other 
attitudinal items; and no mention of the com- 
munication was made. In order to minimize 
surveillance, Questionnaire III was anonymous 
and was in no way tied in to the communi- 
cation. 


HYPOTHESES 


The specific hypotheses that were tested were 
as follows: 


1. Attitudes adopted from a communicator 
whose power is based on means-control will 
tend to be expressed only under conditions of 
surveillance by the communicator. The mediat- 
ing process postulated here is compliance. 

The specific prediction can be noted in Ta- 
ble 1. In the means-control group there will be 
a greater amount of acceptance of the com- 
municator's position on Questionnaire I than 
on Questionnaires II and III, since Question- 
naire I is the only one completed under con- 
ditions of surveillance. There should be no 
significant difference between Questionnaires II 
and III. 

2. Attitudes adopted from a communicator 
whose power is based on attractiveness will 
tend to be expressed only under conditions of 
salience of the subject's relationship to the 
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communicator. The mediating process postu- 
lated here is identification. 

Specifically, it is predicted that in the attrac- 
tiveness group there will be a smaller amount 
of acceptance of the communicator's position 
on Questionnaire III than on Questionnaires 
I and II, since Questionnaire III is the only 
one completed under conditions of non- 
salience. There shoud be no significant differ- 
ence between Questionnaires I and II. 

3. Attitudes adopted from a communicator 
whose power is based on credibility will tend 
to be expressed under conditions of relevance 
of the issue, regardless of surveillance or sa- 
lience. The mediating process postulated here 
is internalization. 


'The specific prediction for the credibility 
group is that there will be no significant dif- 
ferences between the three questionnaires, 
since they were all completed under conditions 
of issue-relevance. 


RESULTS 


Before proceeding to examine the data which 
bear directly on the hypotheses, it was neces- 
sary to check on the success of the experimental 


variations. Did the subjects really perceive each 
of the variations in communicator power in 
the way in which we intended it? To provide 
an answer to this question, Questionnaire II 
included a series of statements about the 
speaker and the communication to which the 
subjects were asked to react. An analysis of 
these data indicated that, by and large, the 
experimental manipulations succeeded in pro- 
ducing the conditions they were intended to 
produce, thus making possible an adequate 
test of the hypotheses. 

The findings which are directly relevant to 
the hypotheses are summarized in Tables 2 and 
3. Table 2 presents the mean attitude scores 
for the four experimental groups on each of 
the three questionnaires. All subjects who had 
completed the three questionnaires were used 
in this analysis. 

It can be seen from the summary of the 
significance tests that all the experimental pre- 
dictions were confirmed. In the means-control 
group, the mean score on Questionnaire I is 
significantly higher than the mean scores on 
Questionnaires II and III; and there is no sig- 
nificant difference between the scores on Ques- 
tionnaires II and III. In the attractiveness 


TABLE 2. Effects of Variations in Communicator Power on Acceptance of 
Induced. Attitudes under Three Conditions of Measurement 


Mean Attitude Scores 


Question- Question- Question- 
Groups N naire I naire II naire III 
Means-control 55 63.98 60.65 58.04 
(compliance) 
Attractiveness 48 56.81 55.94 49.67 
(identification) 
Credibility 51 59.51 56.39 56.10 
(internalization) 
Low power 48 49.33 50.58 53.85 
Summary of Significance Tests 
Sources of Variation F p 
(1) Between questionnaires 3.6 <0.05 
Means-control (2) I versus I1 and II 5.8 <0.05 
(3) II versus IHI 1.4 n.s. 
(1) Between questionnaires 7.2 <0.01 
Attractiveness (2) Land II versus III 14.2 «0.01 
(3) I versus II 0.2 ns. 
Credibility Between questionnaires 2.3 ns. 
Low power Between questionnaires 2.0 ns. 
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group, the mean score on Questionnaire III 
is significantly lower than the mean scores on 
Questionnaires I and II; and there is no sig- 
nificant difference between the scores on Ques- 
tionnaires I and II. In the credibility group, 
there are no significant differences beween the 
three questionnaires. 

While these results are all in line with the 
hypotheses, examination of the means in Table 
2 reveals that the findings are not so clear-cut 
as they might be. Specifically, we should expect 
a relatively large drop in mean score for the 
means-control group from Questionnaire I to 
Questionnaire II. In actual fact, however, the 
drop is only slightly higher than that for the 
credibility group. This might be due to the 
fact that the analysis is based on all subjects, 
including those who were not influenced by 
the communication at all. The hypotheses, 
however, refer only to changes from question- 
naire to questionnaire for those people who 
were initially influenced. 

It was not possible to identify the subjects 
who were initially influenced, since there were 
no before-scores available for the experimental 
groups. It was possible, however, to approxi- 
mate these conditions by using only those sub- 


jects who had a score of 60 or above on 
Questionnaire I. If we make certain limited 
assumptions (which I cannot spell out in this 
brief report), it can be shown that the use of 
a cutoff point of 60 “purifies” the experimental 
groups to some degree. That is, the subsamples 
selected by this criterion should have a higher 
ratio of influenced to uninfluenced subjects 
than the total groups from which they were 
selected. It was anticipated that an analysis 
based on these subsamples would provide a 
better test of the hypotheses and would yield 
more clear-cut results. This did, in fact, hap- 
pen, as can be seen from Table 3. 

Table 3 presents the mean attitude scores 
for the three high-power groups, using only 
those subjects who had scores of 60 or above 
on Questionnaire I. Examination of the means 
reveals a pattern completely consistent with 
the hypotheses. In the means-control group, 
agreement with the communicator is relatively 
high on Questionnaire I and declines on 
Questionnaires II and III. In the attractive- 
ness group, agreement is high on Question- 
naires I and II and declines on Questionnaire 
III. In the credibility group, changes from 
questionnaire to questionnaire are minimal. 


TABLE 3. Effects of Variations in Communicator Power on Acceptance of 
Induced Attitudes under Three Conditions of Measurement* 


Mean Attitude Scores 


Question- Question- Question- 
Groups N naires I naires IT naires III 
a Ee 
Means-control 30 78.20 70.76 67.56 
(compliance) 
Attractiveness 23 71.80 69.57 59.70 
identification) 
crei 26 73.85 71.04 69.27 
(internalization) 
Summary of Significance Tests 
Sources of Variation F p 
(1) Between questionnaires 5.2 <0.01 
Means-control (2) I versus II and III 9.4 «0.01 
(8) II versus III 0.9 n.s 
(1) Between questionnaires 14.5 <0.01 
Attractiveness (2) I and II versus III 28.4 <0.01 
(3) I versus II 0.6 n.s 
Credibility Between questionnaires 1.1 ns. 


* Data based on a selected sample, 
enced subjects. Criterion for selectio 


naire I. 


containing a higher proportion of influ- 
n was a score of 60 or above on Question- 
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Analyses of variance clearly confirmed all the , motivational bases. Let us take, for example, 


experimental predictions. 


CONCLUSIONS 


It would be premature to accept the hy- 
potheses tested in this experiment as general 
principles that have been proved. The experi- 
ment does, however, lend considerable support 
to them. To the extent to which the hypothe- 
ses were substantiated, the experiment also 
gives support to the theoretical framework 
from which these hypotheses were derived. The 
mediating concepts of compliance, identifica- 
tion, and internalization seem to provide a 
unified and meaningful way of organizing the 
present experimental findings and of relating 
them to a more general conceptual framework. 

The framework presented here can be ap- 
plied directly to the analysis of the effects of 
various communications and other forms of 
social influence on attitudes and actions in the 
international sphere. In the study of public 
opinion, for example, it should help us iden- 
tify some of the conditions which are likely 
to produce one or another of these processes 
` and predict the subsequent histories and action 
implications of attitudes adopted under these 
sets of conditions. This framework may also 
be helpful in the study of the social influences 
which affect decision-making proceses and 
negotiations on the part of various elites. 

Some of the concepts presented here might be 
useful not only for the study of change but also 
for the analysis of existing attitudes and their 
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51. Attitude and the Prediction of Behavior 


MARTIN FISHBEIN 


Throughout this book, relatively little has been 
said about the relations between attitude and 
behavior. To a large extent, this omission re- 
flects the present state of the field. After more 
than seventy-five years of attitude research, there 
is still little, if any, consistent evidence sup- 
porting the hypothesis that knowledge of an 
individual's attitude toward some object will 
allow one to predict the way he will behave 
with respect to the object. Indeed, what little 
evidence there is to support any relationship 
between attitude and behavior comes from stud- 
ies showing that a person tends to bring his 
attitude into line with his behavior rather than 
from studies demonstrating that behavior is a 
function of attitude (e.g., Cohen, 1960; Gerard, 
1965; Landy, 1966). 

It is my contention that we psychologists 
have been rather naive in our attempts to un- 
derstand and to investigate the relationships 
between attitude and behavior. More often than 
not, we have attempted to predict some behav- 
ior from some measure of attitude and found 
little or no relationship between these variables. 
Yet, rather than questioning our basic assump- 
tion that there is a strong relationship between 
attitude and behavior, we have tended to blame 
our failures on our measuring instruments, on 
our definition of attitude, or on both. Thus, 
from its relatively simple beginning as à unidi- 
mensional concept that referred to the amount 
of affect for or against some psychological ob- 
ject, the concept of attitude has grown into a 
complex, multidimensional concept consisting 
of affective, cognitive, and conative components. 

While this view has done a great deal to 
stimulate new types of research and, more spe- 


* Revised and expanded for this book from "The 
Relationships between Beliefs, Attitudes, and Be- 
havior," in S. Feldman (Ed.), Cognitive Consistency: 
Motivational Antecedents and Behavioral Conse- 
quents, Academic Press, New York, 1966, with per- 
mission of the publisher. 


cifically, to improve our understanding of the 
relationship between belief and attitude (i.e. 
between cognition and affect), and while it has 
recently led to investigations of the relation- 
ships between attitude and behavioral inten- 
tions (i.e. between affect and conation), it has 
not, to the best of my knowledge, contributed 
very much to our understanding of the specific 
relationships between attitude and behavior. 


THE CONCEPT OF ATTITUDE 


Before turning to a consideration of these re- 
lationships, however, let us first briefly review 
the concept of attitude. Perhaps the best way 
to begin is to provide a brief (and admittedly 
selective) historical sketch. In 1935, Gordon All- 
port reviewed the general area of attitude the- 
ory and research. After considering more than 
one hundred different definitions of attitude, 
Allport concluded that most investigators ba- 
sically agreed that an attitude is a learned pre- 
disposition to respond to an object or class of 
objects in a consistently favorable or unfavor- 
able way. Furthermore, he pointed out that this 
bipolarity in the direction of an attitude (i.e., 
the favorable versus the unfavorable) is often 
regarded as the most distinctive feature of the 
concept. Thus attitude was conceptualized as a 
simple unidimensional concept. 

However, as Allport noted, research based on 
this conception of attitude had not resulted in 
behavioral prediction, and it appeared to him 
that this unidimensional view was oversimpli- 
fied. Indeed, from Allport's point of view, two 
people could be equally favorable toward an 
object, yet feel differently about components or 
characteristics of the object. For example, two 
people could feel equally favorable toward the 
church but feel quite differently about charac- 
teristics or practices of the church. Similarly, 
two people could be equally in favor of change 
but disagree on the modus operandi of a reform 
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movement. Thus, according to Allport, although 
two people may have the same degree of affect 
toward an object, they may differ qualitatively 
in their attitude toward it. Obviously, one rea- 
son why we cannot predict behavior from atti- 
tude, he continued, is that our measures of 
attitude are unidimensional and do not take 
this qualitative nature of attitude into account. 

Allport argued, therefore, for the consider- 
ation of the qualitative nature of attitude. 
Allport's plea was not accompanied by a tech- 
nique for measuring these qualitative differ- 
ences, however, and this, in part, may explain 
why the plea fell on deaf ears. Indeed, following 
Allport's review, investigators still continued to 
measure attitude by obtaining single scores that 
essentially placed the respondent somewhere 
along a single dimension of favorableness or un- 
favorableness toward the attitude object. In 
fact, it is worth noting that two of the major 
attitude measuring instruments that have been 
introduced since Allport’s review have been de- 
signed specifically to obtain unidimensional 
scores (i.e., the Guttman Scale and the Semantic 
Differential). Nevertheless, the prediction of be- 
havior remained a major concern, and an un- 
solved problem. 

I think the second major step came when 
Leonard Doob (1947) suggested that there may 
not be any one-to-one relationship between 
attitude and behavior. Doob's argument was 
quite straightforward, and it seems surprising 
that so little attention has been paid to it. 

Basically, Doob's argument was as follows: 
Attitude is a learned predisposition to respond; 
that is, it is a learned mediating response (r,). 
Thus one has to learn the attitude—the appro- 
priate predisposition toward any given object. 
But once one has learned the attitude, he also 
must learn what response to make to it—that 
is, there is no innate relationship between the 
attitude and behavior; one still has to learn a 
behavioral response. Two people may learn to 
hold the same attitude toward a given stimulus; 
clearly, however, they may also learn to make 
different responses given the same learned atti- 
tude, 

For example, two students may learn to feel 
equally favorable toward a given instructor. 
Furthermore, this feeling may initially elicit 
the same overt response in both students (eg. 
calling the instructor by his first name). From 
Doob's point of view, the probability that this 
behavior will persist is a function of the re- 
inforcement the students get for making this 


response. For example, with respect to one stu- 
dent, the instructor might say, "Well, I'm glad 
you finally decided to drop that ‘professor’ non- 
sense,” while he might tell the other student, 
“I'd prefer it if you wouldn't call me by my 
first name." If this were the case, Doob's theory 
would predict that the behavior would continue 
for the first student, but not for the second. 

Obviously, there may be some problems with 
this approach. More specifically, it is possible 
that the students' attitudes toward the instruc- 
tor may also change differentially over time as 
a function of this reinforcement. However, even 
if this is the case, the main point is that Doob 
provided a reasonable partial answer, or at least 
an approach to an answer, to the problem of 
the relation. between attitude and behavior. 
Doob was essentially arguing that attitude and 
behavior could be unrelated; that it was per- 
fectly reasonable for two people with the same 
attitude to behave differently. Unfortunately, 
Doob's major point was not only not accepted, 
but it actually produced something akin to a 
“boomerang effect." That is, in Chein's (1948) 
critique of Doob's article, he essentially thanked 
Doob for pointing out what was wrong with 
our measures and definitions of attitude. In- 
deed, said Chein, Doob has pointed out that 
two people may feel the same amount of affect 
toward an object but may behave differently 
with respect to that object, or they may hold 
different beliefs about what should be done 
with respect to that object, or both. Clearly 
then, because the “action” component of atti- 
tude is different, these people must have differ- 
ent attitudes. Similarly, Chein continued, two 
people may be equally favorable toward the 
object, but they may also have different cogni- 
tions about the object; they may believe differ- 
ent things about it. Here again, these people 
must have different attitudes, An attitude, then, 
has several components. Now, this was not the 
first time that a multicomponent definition was 
advanced, nor was it the last. Indeed, if you 
pick up almost any social psychology textbook, 
you will find that attitude is currently defined 
as a concept containing an affective, a cognitive, 
and a conative component. 

In contrast to this, I prefer, following Thur- 
stone (1931), to view attitude as a relatively 
simple unidimensional concept, referring to 
“the amount of affect for or against a psycho- 
logical object” (p. 261). Rather than viewing 
beliefs and behavioral intentions as a part of 
attitude, I prefer to define them independently 
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and to view them as phenomena that are re- 
lated to attitudes. More specifically, I see beliefs 
and behavioral intentions as determinants or 
consequents of an individual's attitude. As 
Green (1954) has pointed out, the concept of 
attitude is a hypothetical variable, abstracted 
from the many statements and actions that an 
individual makes with respect to a given object. 
Thus, rather than viewing statements about the 
object (i.e., beliefs) and statements about ac- 
tions that one would take with respect to the 
object (i.e, behavioral intentions) as part of 
attitude or as the attitude per se, I feel that 
these statements can best be viewed as indicants 
of an individual's attitude. 

One argument for the unidimensional view- 
point is a pragmatic one. If one accepts a multi- 
dimensional view of attitude, this implies that 
the "attitude" of any one person toward an ob- 
ject may fall at three very different positions 
on three different dimensions. Nevertheless, the 
operations by which attitudes are measured 
almost invariably yield a single score that is 
unlikely to reflect these three different compo- 
nents in any very precise fashion. As a matter 
of fact, people who construct "attitude scales" 
rarely maintain that their instruments are mea- 
suring three components; instead, they usually 
contend that their scales indicate people's evalu- 
ations of, or affect toward, an object or concept. 
Thus, although attitudes are often said to in- 
clude all three components, it is usually on 
evaluation or "the affective component" that is 
measured and treated by researchers as the es- 
sence of attitude. 

Furthermore, there is considerable evidence 
showing that this single "affective" score is 
highly related to an individual's beliefs about 
the object. The research of Rosenberg (1956, 
1960), Zajonc (1954), Fishbein (1963, 1965a, 
19655), and others has demonstrated that an 
individual's attitude (or affect) toward any ob- 
ject is a function of his beliefs about the object 
(i.e, the probability or improbability that the 
object is related to some other object, value, 
concept, or goal) and the evaluative aspects of 
those beliefs (i.e., the evaluation of—or attitude 
toward—the “related concept"). 

In addition, as Fishbein (see pp. 257 to 266 
in this book) has pointed out, à consideration 
of most standardized instruments for measuring 
attitudes will also demonstrate that the single 
"affective" score they obtain is in fact derived 
from a consideration of a subject's beliefs and 
the evaluative aspects of those beliefs. For ex- 


ample, in Thurstone Scaling and Likert Scaling 
the subject is confronted with a series of belief 
statements. In both cases, the attitude score is 
indexed from a consideration of the respon- 
dent's beliefs (i.e, his agreement or disagree- 
ment with each of the statements) or, as Green 
suggests, it is abstracted from several of his 
statements about the attitude object. 

Similarly, the single “affective” score obtained 
from Bogardus’ Social Distance Scales is based 
on a consideration of an individual's behavioral 
intentions and their evaluative aspects. 

Thus, from my point of view, this hypothet- 
ical variable that we call an "attitude" can be 
measured by considering either beliefs or be- 
havioral intentions, or by attempting to get at 
evaluation per se (e.g., through the use of such 
instruments as the Semantic Differential). It 
seems that these different types of instruments, 
or approaches, are attempting to measure the 
same thing; each is attempting to arrive at a 
single score that will represent how favorable 
or unfavorable the individual is toward the 
attitude object in question. 


THE RELATIONSHIPS BETWEEN ATTITUDE 
AND OTHER PHENOMENA 


Taking a unidimensional view of attitude 
does not imply that one should ignore cognition 
and conation. Rather, it implies that beliefs 
and behavioral intentions must be studied in 
their own right, as independent phenomena 
that may be related to attitude and behavior. 
Thus the problem is not simply to investigate 
relationships between attitude and behavior; 
rather, one must be concerned with at least four 
things: attitudes, beliefs, behavioral intentions, 
and behavior. The problem, then, is to investi- 
gate the interrelations among all four of these 
concepts. 

Although considerable progress has been made 
in understanding the relationships between be- 
liefs and attitudes, little has been done in in- 
vestigating the relationships between beliefs and 
behavioral intentions, attitudes and behavioral 
intentions, or any of these three and behavior 
per se. In the remainder of this paper, I should 
like to consider some of these interrelationships. 


The Relations between Beliefs and Attitudes 
(That Is, Cognition and Affect) 
As already mentioned, it appears that we now 


have fairly conclusive evidence that a person's 
attitude toward any object can be seen as a 
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function of his beliefs about the object (i.e., the 
probability or improbability that the object has 
a specific relationship with some other object, 
value, concept, or goal) and the evaluative as- 
pects of those beliefs (i.e., the subject’s attitude 
toward, or evaluation of, the “related object’’).1 

It should be noted, however, that although 
this evidence provides strong support for the 
general hypothesis that an individual's attitude 
toward any object is a function of his beliefs 
about the object and their evaluative aspects, 
it does not imply that any given belief will be 
correlated with the attitude. Indeed, this rela- 
tionship between beliefs and attitudes indicates 
that it is quite probable that an individual 
will have some beliefs that appear "inconsis- 
tent" with his attitude. As Fishbein (pp. 257 to 
266 in this book) has pointed out, "Although 
each belief suggests an attitude, the attitude per 
se can only be reliably abstracted by consider- 
ing the many beliefs an individual holds" 
(p. 264). Thus, while an individual's attitude 
will be highly correlated with an estimate based 
on a consideration of many of his beliefs, it 
may be uncorrelated or even negatively corre- 
lated with any single belief considered in iso- 
lation. Although the implications of this for the 
various "consistency theories" are beyond the 
scope of the present paper, it should be noted 
that this view suggests that it is not necessarily 
"inconsistent" for an individual (a) to have a 
favorable attitude toward some "object" and 
(b) to believe that the "object" has some nega- 
tive characteristics, qualities, or attributes. 

In addition to its implications for "consis- 
tency theories," the strong relationship between 
beliefs and attitudes also suggests one major 
reason why many investigators have failed to 
find relationships between attitudes and behav- 
ior: often the attitude being measured may be 
inappropriate. For example, in many cases an 
individual's attitude toward a class of people 
or objects has been measured, and this attitude 
has been used in an attempt to predict the in- 
dividual's behavior with respect to a particular 
member of that class. 

Thus one frequently measures a subject's at- 
titude toward "Negroes," and then attempts to 
predict his behavior with respect to a particular 
Negro individual. However, it is very unlikely 
that the subject's beliefs about this particular 


1 For two studies providing evidence for this hy- 
pothesis, see pages 325 to 331 and 389 to 400 in 
this book. The algebraic expression of the hypoth- 
esis appears on page 489, 


Negro individual are even similar to his beliefs 
about "Negroes in general." Clearly, if a sub- 
ject were asked to describe a “Negro,” he would 
give a very different set of responses than if 
he were asked to describe Martin Luther King 
or Cassius Clay. Because his beliefs about these 
stimuli would differ, his attitudes toward them 
would also differ. Indeed, probably the only 
reason why even minimal correlations between 
attitude and behavior have been obtained in 
some cases is the fact that one belief about the 
particular stimulus person is that “he is a 
Negro”; this belief, being quite strong, will 
contribute heavily to the attitude. However, 
even this may not occur with respect to other 
stimulus persons who cannot be readily identi- 
fied as members of a given group (e.g., Jews, 
Catholics, etc.). It seems fairly obvious that the 
chances of predicting behavior from attitude 
are practically nil until we at least start measur- 
ing attitudes toward the appropriate stimulus. 

The problem often becomes even more 
complex, however. For example, we are often 
concerned with predicting different types of so- 
ciometric choices. Thus we may try to predict 
which group members a particular member will 
choose as friends or as co-workers. Bales’ (1958) 
research has provided strong evidence that the 
person we may like the best and the person 
we would most like to work with are usually 
two different people. This implies that our 
attitude toward a person as a co-worker may be 
quite different from our attitude toward the 
same person as a friend. This should not be 
surprising; if an individual were asked to de- 
scribe a particular person as a card player, he 
would give a different set of beliefs about that 
person than if he were asked to describe the 
same person as a co-worker. Thus, as a minimal 
initial step in trying to understand the rela- 
tionships between attitudes and behavior, we 
must first start measuring attitudes toward the 
appropriate stimuli, vis-A-vis the behaviors we 
are attempting to predict. 


The Relations between Attitude and Behavioral 
Intentions (That Is, Affect and Conation) 


In a recent series of papers, Triandis (sce 
pp. 208 to 219 in this book) and his associates 
(e.g., Triandis and Davis, 1964; Fishbein, 1964; 
Triandis, Fishbein, Hall, Tanaka, and Shanmu- 
gan, 1967) have been exploring the "behavioral" 
or "conative" component of attitude. In the first 
paper in this series, Triandis developed an in- 
strument called the Behavioral Differential. 
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This instrument consists of a series of behav- 
ioral statements (eg, “accept as an intimate 
friend,” “go fishing with,” “admire the ideas 
of,” etc.) similar to those found on Bogardus’ 
(1925) Social Distance Scales. The respondent is 
asked to indicate (usually on a nine-place scale) 
the extent to which he “would” or “would not” 
engage in these behaviors with respect to a 
given stimulus person. Starting with a large set 
of behaviors obtained from a content analysis 
of novels, Triandis initially selected behavioral 
items on the basis of a facet analysis, and fur- 
ther reduced the number of items through fac- 
tor analyses, Generally speaking, factor analyses 
of a large number of item-by-item intercor- 
relation matrices (obtained in several different 
studies, utilizing different behavioral items and 
different stimuli) have indicated five basic clus- 
ters of items. That is, items always seem to 
cluster together into the five types shown in 
Table 1. 

It should be noted that these five types of 
intentions have not been referred to as different 
dimensions of behavioral intentions, but only 
as clusters or types. Depending largely on the 
types of stimulus persons that are rated, these 
different clusters tend to go together in different 
ways (i.e, they may load on the same or differ- 
ent dimensions in any given factor analysis). 
For example, in one study, admiration and 
friendship items may load on one factor, while 
social distance and subordination intentions 


may load on a second factor. In another study, 
admiration and subordination intentions may 
load on one factor, while friendship and social 
distance intentions may load on a second factor. 
Thus, although Triandis has essentially identi- 
fied five different general types of behavioral 
intentions, it should be made clear that these 
five types are usually not independent and, fur- 
thermore, that their relations to one another 
depend largely on the type of stimulus person 
being rated. 

Turning to the question of the relations be- 
tween these five types of behavioral intentions 
and attitude (i.e., affect), it should first be noted 
that because these behavioral intentions inter- 
correlate differentially with one another, they 
also are differentially related to attitude. That 
is, in one study attitude may be more highly 
correlated with friendship intentions than with 
social distance intentions, while in another 
study the findings may be reversed. Generally 
speaking, however, the results of several studies 
seem to indicate that attitudes are most highly 
correlated with admiration and friendship in- 
tentions (r=.55), are moderately correlated 
with subordination and social distance inten- 
tions (r=.35), and are least correlated with 
marital intentions (r — .15). 

While the correlations between attitude and 
the different types of behavioral intentions vary 
considerably, the correlation between attitude 
and the sum of the behavioral intentions tends 


TABLE 1. Item Clusters in Behavioral Differential 
aS 


Typical Items 


Cluster 
l. Marital I would: go on a date with 
fall in love with 
marry 
2. Admiration I would: admire character of 
believe 
admire idea of 
praise suggestions 
, ial di e I would: invite to my club 
"ums exclude from my neighborhood 
accept as a close kin by marriage 
4. Friendship I would: accept as an intimate friend 
treat as equal 
eat with 
5. Subordination- I would: be commanded by 
superordination elect to political office 


treat as a subordinate 
work for 


S 
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to be quite stable and high (F = -10). Thus, just 
as any given belief may be uncorrelated with 
an individual's attitude, any given behavioral 
intention (e.g. "I would elect this person to 
political office") or any one type of behavioral 
intention (eg. subordination-superordination 
intentions) may also be uncorrelated with an 
individual's attitude. However, when one con- 
siders the totality (or at least a large set) of 
an individual's behavioral intentions, good esti- 
mates of attitude can be obtained. We shall 
return to this point shortly; first, however, it is 
worth noting that it is with respect to behav- 
ioral intentions (and behavior per se) that 
Doob's argument seems most appropriate. 

More specifically, if we consider what has 
been happening in the South, we can get a 
clear picture of what Doob was referring to. 
That is, the Civil Rights Act of 1964 and var- 
ious judicial decisions have had a profound ef- 
fect on some behavioral intentions and behav- 
iors of many Southerners. Many individuals in 
the South, if asked today, would say that they 
would admit Negroes to their hotels, motels, or 
restaurants. (And, indeed, Negroes have been 
admitted to various places that previously had 
refused to serve them.) I doubt very much, how- 
ever, if any of us would really want to say that 
these individuals have thereby changed their 
attitudes toward Negroes. 

As Doob pointed out, however, behavioral in- 
tentions (as well as overt behaviors) can be 
either positively or negatively reinforced. If our 
initial behavioral intentions or behaviors were 
always positively reinforced, it is likely that 
there would always be a clear relationship 
between attitude and behavioral intentions. 
Behavioral intentions are not always positively 
reinforced; indeed, they are often negatively 
reinforced. Furthermore, we are often positively 
reinforced for having behavioral intentions that 
are inconsistent with our attitudes. Once again, 
I think we can all see many examples of this 
in the South today. 

Before going any further, one point should 
be made clear. Namely, although everything 
that has been said about the relationship be- 
tween attitude and behavioral intentions is also 
true of the relationship between attitude and 
behavior per se, this does not imply that there 
is any given relationship between behavioral 
intentions and behavior. As we shall see below, 
the size of the correlation between a behavioral 
intention and behavior is determined largely 
by the specificity of the behavioral intention 


that is being considered. Indeed, everything 
that has been said about the relationships be- 
tween attitudes and behavioral intentions may 
also apply to the relationships between Trian- 
dis five types of behavioral intentions and 
overt behaviors. 

This is important, for I do not think a 
change in a behavioral intention leads neces- 
sarily to a change in attitude unless the change 
in behavioral intention is accompanied by a 
change in behavior per se. That is, a change 
in behavior implies a new set of relationships 
between the individual and the attitude object. 
This may then lead to the learning of new 
beliefs about the attitude object, and thus to 
attitude change. These beliefs will not be 
learned overnight, however, and the change in 
attitude may lag well behind the change in 
behavior. 

Thus, although an individual's attitude 
might initially influence and be related to spe- 
cific behavioral intentions (or to specific be- 
haviors, this relationship may or may not 
persist, depending on the nature and schedule 
of reinforcement associated with the behav- 
ioral intentions. Furthermore, unlike a change 
in belief that immediately produces an attitude 
change, changes in a behavioral intention may 
not produce a change in attitude unless the 
change in behavioral intention eventually 
leads to changes in behavior and thus changes 
in belief about the attitude object. 

"Therefore, rather than being viewed as parts 
of attitude, behavioral intentions should be 
viewed as independent phenomena. Just as a 
consideration of beliefs (or the cognitive com- 
ponent) yields a single score that indicates how 
favorable or unfavorable an individual is with 
respect to the attitude object, so, too, does a 
consideration of behavioral intentions (or the 
conative component). That is, as was mentioned 
earlier, an individual's attitude toward any ob- 
ject may be assessed by measuring affect or 
evaluation per se, by considering his beliefs or 
his behavioral intentions, or some combination 
of all of these. 

It appears, then, that the multicomponent 
view of attitude is misleading and inappropri- 
ate. Indeed, in contrast to the major criticism 
leveled at the unidimensional view of attitude, 
this single "affective" score does, in a sense, 
take into account and reflect an individual's 
beliefs and his behavioral intentions. Although 
two people may have the same attitudes, how- 
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ever, these attitudes may be based on different 
beliefs or may be related to different behavioral 
intentions. As the multicomponent theorists 
have pointed out, this may well be one of the 
major reasons why attitudes are uncorrelated 
with behavior. But here I would argue that it 
is not because the attitude measure is inappro- 
priate or incomplete. Rather, it is because atti- 
tude is a hypothetical variable abstracted from 
the totality of an individual's beliefs, behavioral 
intentions, and actions toward a given object. 
Any given belief, behavioral intention, or be- 
havior, therefore, may be uncorrelated or even 
negatively correlated with his attitude. Thus, 
rather than viewing specific beliefs or classes of 
beliefs and specific behavioral intentions or 
types of behavioral intentions as part of atti- 
tude, these phenomena must be studied as vari- 
ables in their own right, which, like attitudes, 
may or may not function as determinants of a 
specific behavior. 


The Relationships between Attitude and 
Behavior (That Is, Affect and Behavior) 


On this note we may finally turn our full 
attention to the presumed topic of this paper. 
First, however, I want to make it clear that 
when I use the term "attitude," I am simply 
referring to a learned predisposition to respond 
to any object in a consistently favorable or un- 
favorable way. Operationally, I do not care 
whether we have obtained our estimate of atti- 
tude through a consideration of affect per se, 
through a consideration of beliefs and their 
evaluative aspects, or through a consideration of 
behavioral intentions and their evaluative as- 
pects. That is, I am talking about a single score 
that places an individual along a continuum 
ranging from favorableness toward the attitude 
object to unfavorableness. Let us further assume 
that this measure is reliable and valid (at least 
insofar as tapping a true underlying dimension 
of favorableness or unfavorableness toward the 
attitude object is concerned). 

Thus far, I have indicated two possible rea- 
sons for our failure to predict behavior from 
attitude: 


1. We have often measured attitude toward 
an inappropriate stimulus object—for example, 
we have often measured attitude toward a class 
of people or objects when we should have been 
measuring attitude toward a particular member 
of the class. 

2. The particular behavior being studied 


may be completely or partially unrelated to 
attitude. This point must be emphasized be 
cause most investigators of attitude have been 
unwilling to accept it. To a large extent, how- 
ever, their unwillingness to accept this state 
ment is surprising since most investigators firmly 
believe that any behavior is determined by a 
large number of variables. Yet time and again, 
a behavior is investigated because the experi- 
menter assumes that it should be a function of 
attitude and then he is surprised and disap- 
pointed to find that his measure of attitude 
failed to predict the behavior. 


Unfortunately, one of the most common re- 
sponses to this failure has been to question the 
attitude measure; it is criticized for measuring 
only affect and not taking into account the so- 
called cognitive or conative dimensions of atti- 
tude. What one should be questioning however, 
is the initial assumption that the behavior being 
considered is a function of the attitude being 
measured. Furthermore, even if measures of spe- 
cific sets of beliefs or types of behavioral inten- 
tions were available, the chances would still be 
high that the behavior could not be predicted. 
That is, it must be realized that the appeal to 
beliefs and behavioral intentions may be based 
on the same false assumption that led to the 
expectation of an attitude-behavior relationship 
—namely, the assumption that the behavior 
being investigated is a function of some type of 
belief or behavioral intention. 

Granted, this assumption may be true in some 
cases, in most cases however, it is quite likely that 
the particular beliefs or behavioral intentions 
we would choose to measure would themselves 
be highly correlated with attitude. Thus they 
would add little, if anything, to our predictive 
ability. The only times when it may be useful 
to consider beliefs and behavioral intentions (in 
addition to attitude) are those situations in 
which the particular beliefs or behavioral in- 
tentions that we wish to consider are not them- 
selves highly correlated with the individual's 
attitude. However, even in those cases we must 
be willing to recognize that the beliefs or be- 
havioral intentions that we do select may also 
be unrelated to the behavior itself. That is, just 
as we must be willing to accept the fact that 
two people with the same attitudes may behave 
differently, so, too, must we be willing to accept 
the fact that two people with the same beliefs 
or two people with the same behavioral inten- 
tions may also behave quite differently. 
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BEHAVIORAL PREDICTION AND MEASURES 
OF BEHAVIOR 


If you will notice, however, I have stopped 
discussing the relations between attitudes and 
behavior and started discussing the general 
problem of behavioral prediction. Essentially 
what I have been pointing out is how we might 
predict behavior in those situations where atti- 
tude is not a relevant variable. More specifically, 
I have indicated that when most attitude re- 
searchers fail to find a relation between attitude 
and behavior, they usually suggest that if we 
were to take one or two other variables into 
account, we might be able to predict the 
behavior. Although this is reasonable if our pri- 
mary concern is to predict behavior—and this 
is what I believe our primary concern really is 
—it is unfortunate that two of the variables 
most commonly suggested (i.e. belief and be- 
havioral intention) are arrived at through a 
consideration of the predictor, rather than 
through a consideration of the behavior per se. 
Here, I think, is where our major problem lies: 
we psychologists have never really studied behav- 
ior per se. By this I mean that we have usually 
taken behavior as a given; to the best of my 
knowledge, we—at least in the attitude area— 
have seldom, if ever, subjected our behavioral 
criteria to the same rigorous analyses to which 
we subject our paper-and-pencil tests. Yet this 
is what we must do if we are to thoroughly un- 
derstand the relations between attitudes and 
behavior (ie. if we are ever going to predict 
the conditions under which, and the extent to 
which, behavior is determined by, or related to, 
attitudes—or to any other variable, for that 
matter). In the remainder of this section, I shall 
try to clarify this by discussing a hypothetical 
example. 

First, let me describe a very simple behavioral 
situation. A naive subject shows up for an ex- 
periment and is asked to wait in an adjoining 
room while the experimenter sets up the ex- 
periment. The adjoining room is fairly small, 
containing only four chairs, arranged so that 
they form the four corners of a square, Seated 
in one of the chairs is a confederate of the ex- 
perimenter, When the naive subject enters the 
room, the confederate looks up and says, "Hello, 
my name is Robert Springer." The two peo- 
ple then remain in the room for a five-minute 
period. 

Now let us suppose that, as the experimenter, 
I want to make an hypothesis about the sub- 


ject's communicative behavior. Furthermore, let 
us assume that I have a highly valid and reli- 
able instrument for measuring the number, 
duration, or type of communicative acts that 
the subject makes. Finally, let us also assume 
that I have a well-trained confederate who be- 
haves in a highly standardized manner. 

Now, to return to my hypothesis, let us sup- 
pose I have a strong hunch that the more the 
subject likes the confederate, the more he will 
talk to him. Thus I have essentially made the 
assumption that the amount of the subject's 
communicative behavior is a function of his 
attitude. To put it another way, I have implic- 
itly made the assumption that this behavior is 
a good indicator of the subject's attitude toward 
the confederate. "That is, rather than viewing 
the behavior as a criterion, I can also view it 
as a measure of the variable I assume it reflects. 
In this case, then, I would essentially be es- 
timating the subject's attitude toward the 
confederate through a consideration of his 
communicative behavior. 

Once we start considering the behavioral 
measure as an estimate of attitude (or any other 
variable), we can start subjecting the behavior 
to the same types of critical analyses to which 
we subject our paper-and-pencil measures. For 
example, one of the most obvious questions that 
should be asked concerns the reliability of the 
behavior. That is, even in the simple situation 
I have described, we could bring the subject 
back at the same time on another day and ob- 
tain a measure of test-retest reliability. Even 
this simple test provides us with a considerable 
amount of new information; if the behavior is 
unreliable, that finding almost immediately 
suggests that this behavior is not a direct func- 
tion of any variable that is reliable or has not 
changed from Time 1 to Time 2. “Unreli- 
ability" provides a clear indication that the 
behavior is not determined directly by any of 
the controlled variables that are part of our 
experimental situation, whether they can be 
specified or not. More specifically, "unreliabil- 
ity" allows us to conclude that this behavior is 
neither situationally determined, nor solely a 
function of any factor that is consistently re- 
lated to or based upon a consideration of the 
stimulus person, that is, the confederate.? 

If the behavior is shown to be reliable, this 


2 Obviously, a change in behavior could be a func- 
tion of a change in attitude. This example, how- 
ever, is only concerned with the case where attitude 
does not change. 
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knowledge permits us to go on to the more 
important question of validity. Obviously, I am 
not talking about the validity of the behavior 
per se (for, by definition, behavior is "real"), 
but rather about the validity of the behavior as 
an indicant or measure of some other variable. 

Here, rather than starting with the usual 
question of convergent validity (i.e, the extent 
to which two different estimates or measures of 
a given variable are correlated), I should like 
first to discuss briefly what Campbell and Fiske 
(1959) have called discriminant validity. Accord- 
ing to these authors, a valid test should not only 
measure what it is supposed to measure, but also 
it should not measure anything else. To quote 
them, “tests can be invalidated by too high 
correlations with other tests from which they 
were intended to differ” (p. 81). 

Furthermore, they point out that every test 
or task employed for measurement purposes 
should be viewed as a trait-method unit, that is, 
as "a union of a particular trait [or variable] 
content with measurement procedures not spe- 
cific to that content." Thus, they continue, “The 
systematic variance among test scores can be 
due to responses to the measurement features 
as well as responses to the trait content" (p. 81). 

To return to our simple situation, it can be 
seen that what I have described is a general, 
content-independent measurement procedure. 
Essentially, I have described a behavioral 
method for measuring a subject's attitude to- 
ward any given stimulus person. Variations in 
the characteristics or behavior of the stimulus 
person (or the situation per se) are analogous, 
then, to what Campbell and Fiske have called 
variations in trait content. 

Continuing, Campbell and Fiske point out 
that "In order to examine discriminant valid- 
ity, and in order to estimate the relative con- 
tributions of trait and method variance, more 
than one trait as well as more than one method 
must be employed in the validation process” 
(p. 81). ‘Thus, in contrast to our usual valida- 
tion technique of correlating some pencil-and- 
paper measure of attitude with one or more 
behavioral criteria (that we assume or hypoth- 
esize to be correlated with attitude), we should, 
at a minimum, be obtaining measures of atti- 
tude toward at least two stimulus objects and 
measuring the subject's behavior with respect 
to each of them. It is only in this way that we 
can start to obtain some indication of a more 
general pattern of relationships between atti- 
tude and behavior. That is, we must start to 


pay more attention to the question of discrimi- 
nant validity if we ever hope to distinguish 
between those behaviors that are functions of 
the situation and those that are related to char- 
acteristics associated with the attitude object. 

Although a complete discussion of Campbell 
and Fiske's multitrait, multimethod matrix 
technique is beyond the scope of the present 
paper, it appears that this technique (with some 
modifications) provides a simple and straight- 
forward procedure for beginning to analyze the 
relationship between attitude and behavior. 
Because Campbell and Fiske's original concerns 
were with the measurement of personality traits 
(which are usually assumed to be independent), 
at least one modification in interpretation 
should be mentioned. Specifically, the Campbell 
and Fiske analysis suggests that when the same 
method is used to measure two or more traits 
and the monomethod, multitrait correlations 
are high, a large (and irrelevant) "methods" 
factor may be influencing the responses. This is 
not necessarily the case in attitude research, be- 
cause subjects’ attitudes toward two or more 
stimuli may often be highly intercorrelated. 

While this relationship does not require 
change in any of the criteria that Campbell and 
Fiske propose for analyzing discriminant valid- 
ity, it does suggest that one alternative criterion 
may be useful in considering discriminant va- 
lidity in attitude research. Specifically, varia- 
tions in the attitude object should produce 
reliable variation in the attitude scores. For 
example, although individuals' attitudes toward 
two stimuli may be highly correlated (e.g., au- 
thoritarian subjects’ attitudes toward Negroes 
and Jews), the method used to measure these 
attitudes would have discriminant validity with 
respect to the two stimulus persons if, in addi- 
tion to Campbell and Fiske's criteria, one found 
a reliable mean difference between the two atti- 
tude scores, that is, if the subjects’ attitudes 
toward Negroes were significantly different from 
their attitudes toward Jews. A similar argument 
applies to the behavioral measures. This may 
be seen more clearly below. 


THE RELATIONSHIP OF ATTITUDE AND 
BEHAVIOR AS SEEN IN A MULTIATTITUDE 
OBJECT, MULTIMETHOD MATRIX 


To return to the main point, it appears that 
in those cases where the attitudinal and behav- 
ioral measures are uncorrelated (ie, where 
there is no convergent validity), the multitrait 
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(or multiattitude object), multimethod approach 
provides a considerable amount of information 
that would not be obtained in other ways. This 
is not to say that one will not obtain conver- 
gent validity between some attitudes and some 
behaviors, nor that these findings are unimpor- 
tant or uninformative. However, the Campbell 
and Fiske approach is probably most useful in 
respect to the further leads it provides in those 
cases where convergent validity is not obtained. 
This may perhaps be seen most clearly if we 
discuss some of the major cases that are possible. 


Case A: Discriminant Validity in the Attitu- 
dinal, but not in the Behavioral Measures 


Suppose that in our hypothetical situation 
we find that, irrespective of variations in the 
characteristics or behaviors of our confederate, 
the behavior in this situation does not vary. 
That is, suppose the results indicate that the 
number of communicative acts do not vary 
whether the confederate is Negro or White, 
male or female, Catholic or Protestant, dressed 
neatly or sloppily; or whether the confederate 
is instructed to agree or disagree with the sub- 
ject, to respond with long sentences or short 
phrases, etc. In other words, suppose we find 
that, at least with respect to the variables we 
are manipulating, our behavioral measure has 
no discriminant validity. Let us further suppose 
that, in contrast, our paper-and-pencil measure 
does reflect these differences, that is, that as we 
manipulate certain characteristics of the con- 
federate, the subject's attitudes toward the con- 
federate change. Thus our attitude measure does 
have discriminant validity. Finally, let us assume 
that this particular paper-and-pencil measure 
has a high degree of convergent validity with 
respect to other measures of attitude, that is, 
that it has construct validity. 

Clearly, then, our failure to obtain conver- 
gent validity with the behavioral measures in 
this situation does not imply that our measure 
of attitude is invalid; rather, it strongly indi- 
cates that this behavior is not sensitive to or not 
related to the attitude we have measured—nor 
to any variable that is likely to correlate with 
either the attitude or any of the characteristics 
or actions that were manipulated. Indeed, rather 
than concern ourselves with attitudes, we can 
be fairly certain that this behavior is either a 
function of some stable individual difference 
variable or some nonstimulus (i.e, nonconfed- 


erate) associated variable or variables in this 
situation. 


Case B: Discriminant Validity in the Behav- 
ioral, but not in the Attitudinal Measures 


Alternatively, of course, the lack of conver- 
gent validity could derive from the finding that 
variations in the stimulus object produce varia- 
tions in the behavior but not in the paper-and- 
pencil measure. In this case our behavioral mea- 
sure would have discriminant validity, but, at 
least with respect to the particular variables we 
have manipulated, our paper and pencil mea- 
sure would not. Furthermore, let us again as- 
sume that the particular paper-and-pencil test 
does have a high degree of construct validity. 
This is an interesting case, for here we have a 
change in behavior that is related to the manip- 
ulation of a specified variable but is not corre- 
lated with the attitude measured. Here again, I 
would argue that our failure to obtain conver- 
gent validity does not imply that our measure 
of attitude is invalid; it would only indicate 
that this behavior is not related to the attitude 
measured, nor to any variable that is likely to 
correlate with the attitude measured. Indeed, 
what we have found is that certain variables 
that function as determinants of behavior with 
respect to a given stimulus person are unrelated 
to attitude toward that person. Thus, if our 
main interest is in predicting the behavior, we 
should concentrate on measuring those partic- 
ular variables rather than on measuring atti- 
tudes toward the person. 

Before continuing, I should mention one 
other possible reason for not obtaining conver- 
gent validity in this case, namely, it could be 
that we have measured attitude toward an in- 
appropriate stimulus object. For example, E 
measured attitudes toward "Catholics" and 
“Protestants” rather than toward the specific 
Catholic individual and the specific Protestant 
individual who served as confederates. The fail- 
ure to find discriminant validity in our attitude 
measure argues against this, however. That is, 
inappropriate stimuli are likely to be fairly 
general or stereotypic, and because our instru- 
ment has construct validity, this type of error 
would increase, rather than decrease, the degree 
of discriminant validity. Although this type of 
error is not indicated in the present case, it is 
indicated in the final set of alternatives I shall 
discuss. 


Case C: Discriminant Validity in Both the At 
titudinal and the Behavioral Measures 


Consider the case where both our behavioral 
and our pencil-and-paper measures have discrim- 
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inant validity but are not correlated with each 
other. Again, this lack of convergent validity 
does not imply that our measure of attitude is 
invalid. Here, however, the alternatives are two- 
fold: (1) as mentioned above, this may well be a 
situation in which we have measured attitudes 
toward an inappropriate attitude object; (2) this 
is also the kind of situation where we may want 
to look for some other variable that is some- 
what related both to our attitude measure and 
to our manipulations. That is, this is the type 
of situation in which variables such as beliefs 
or behavioral intentions are truly likely to in- 
crease our ability to predict behavior. 

Obviously, there are many other alternatives 
that may be differentiated when one uses a 
multiattitude object, multimethod approach 
(e.g., both attitude and behavioral measures 
may have low discriminant validity; convergent 
validity may be obtained with only one set of 
the attitude and behavioral measures). Although 
a complete discussion of these alternatives is 
beyond the scope of the present paper, it is 
worth noting that each alternative carries dif- 
ferent implications for an understanding of the 
attitude-behavior relationship. In contrast to 
this, the usual procedure of obtaining a single- 
attitude measure and attempting to predict one 
or more behaviors provides us with little or no 
information in those cases where convergent 
validity is not obtained. That is, the failure to 
find convergent validity in such cases can be 
interpreted in many ways. As was pointed out 
above, the most common interpretation has 
been that there was something wrong with the 
measure of attitude (e.g., it was inappropriate; 
it was only a measure of affect and did not 
take the cognitive and the action component 
into account). However, as was also pointed 
out above, this lack of convergent validity can 
also be interpreted as indicating that the be- 
havior under consideration is unrelated to the 
attitude that was measured. More specifically, 
the multiattitude object, multimethod approach 
clearly indicates that many of an individual's 
behaviors with respect to a given object are 
likely to be primarily under the control of vari- 
ables other than the individual's attitude toward 
that object. In particular, it emphasizes the 
importance of the situation as a factor deter- 
mining behavior. 

Although many investigators have often rec- 
ognized the importance of situational and 
"other" variables as determinants of behavior, 
most treatments of attitude have not dealt with 


these factors explicitly. Rather, they have usu- 
ally been viewed as sources of "error" variance. 
The conception of attitude as a “predisposition 
to respond” has generally led to the assumption 
that an individual's behavior with respect to 
some object is a direct function of his attitude 
toward the object, and “other” factors are 
viewed as "noise" in the system. In the final 
section of this paper, a theory will be presented 
that attempts to deal with this larger constella- 
tion of behavioral determinants within a single 
formulation of the attitude problem. Rather 
than starting with the assumption that there is 
a strong underlying relationship between atti- 
tude and behavior, the theory to be presented 
emphasizes the importance of situational, mo- 
tivational, and normative variables as factors 
influencing overt behavior. 


THE ATTITUDE-BEHAVIOR RELATIONSHIP: 
A REANALYSIS 


The theory to be presented below can best 
be seen as an adaptation of Dulany's (1961, 
1964) theory of propositional control. Although 
Dulany's theory has largely been developed 
within the context of studies of verbal condi- 
tioning and concept attainment, it is essentially 
a theory that leads to the prediction of overt 
behavior. It is hoped that the present approach 
will provide a more complete understanding 
of the relationships between attitudes and be- 
havior, by identifying some of the ways in 
which other variables interact with attitudes 
as determinants of overt behavior. More spe- 
cifically, the present approach may be viewed 
as an attempt to bridge the gap between tradi- 
tional measures of attitude toward a stimulus 
object and behavior with respect to that object 
in a given situation. 

As mentioned above, Dulany's theory has 
largely been developed within the context of 
studies of verbal conditioning and concept 
attainment. More specifically, Dulany has been 
concerned with predicting the probability with 
which an individual will make a particular 
verbal response or class of verbal responses. 
The central equation of the theory can be ex- 
pressed as follows: 


BI = ((RHd)(A)]w, + [(BH)(Mc)yw; 


where 
BI — the subject's intention to make a par- 
ticular response or class of responses 
RHd — a "hypothesis of the distribution of 
reinforcement," that is, the subject's 
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hypothesis that the occurrence of the 
particular response will lead to a cer- 
tain event or class of events 
A z the affective value of the reinforce- 

ment, that is, the subject's evaluations 
of (or attitudes toward) those events 

BH = the subject's “behavioral hypothesis,” 
that is, his belief as to what he is 
expected to do, or what he should do 
in the situation 

Mc = the subject's “motivation to comply,” 
that is, how much the subject wants 
to do what he believes is expected of 
him 


Two additional points about the theory 
should be mentioned: 


1, According to the theory, behavioral in- 
tention is the immediate antecedent of overt 
behavior. Unlike the general types of behav- 
ioral intentions that most attitude researchers 
(eg. Triandis, 1964; Triandis, Fishbein, and 
Hall, 1964) have been concerned with (e.g., 
intentions to marry, to show social distance, 
to subordinate, to accept as a friend), Dulany 
has been concerned with a more precise and 
specific type of behavioral intention—namely, 
an individual's intention to perform a given 
action in a given situation. 

"That is, it will be recalled that in order to 
obtain a measure of an individual's behavioral 
intention (to show social distance, for example), 
Triandis would ask the subject to indicate 
whether he (ie, the subject) "would" or 
"would not" engage in a specific set of behav- 
iors (eg. “invite to my club," "exclude from 
my neighborhood," "accept as a close kin by 
marriage," etc.) with a given stimulus person. 
Triandis would then sum the responses to 
these items, and this sum would be taken as 
the measure of the respondent's behavioral 
intention (to show social distance). In contrast 
to this, Dulany has been concerned with mea- 
suring the subject's intention to perform the 
specific behavior he is interested in predicting. 
Furthermore, unlike Triandis, Dulany would 
not ask the subject to indicate his intention 
to engage in the act "in general" (e.g, mea- 
sure his intention to "obey" the stimulus per- 
son), but rather he would measure the subject's 
intentions to engage in the act in a particular 
situation. Because of this close correspondence 
between the measure of the behavioral inten- 


tion and the actual behavior that Dulany wants 
to predict, the correlation between the measure 
of behavioral intention and the actual overt 
behavior is almost perfect (i.e., the correlations 
are always in the mid .90's. Thus if one can 
predict the specific behavioral intention, one 
may, with only slightly attenuated accuracy, 
predict the overt behavior. It must be empha- 
sized, however, that these near-perfect corre- 
lations between behavioral intentions and 
behavior are only obtained, and are only ex- 
pected, when one considers an individual's 
intention to perform a specific act in a specific 
situation, The more abstract or generalized 
the intention becomes, the lower will be its 
correlation with a specific behavior. 

2. As can be seen above, the algebraic ex- 
pression of the theory takes the form of a lin- 
ear multiple regression equation. That is, 
[(RHd)(A)) is viewed as one component influ- 
encing behavioral intentions and [(BH)(M«c)] 
is seen as a second component. The precise 
weights to be given these two components as 
determinants of behavioral intentions within 
a given situation may be determined by stan- 
dard multiple-regression procedures. 


A liberal interpretation of this theory as 
applied to social behavior may be given as 
follows. An individual's intention to perform 
a specific act, with respect to a given stimulus 
object, in a given situation, is a function of 
the following: 


la. His beliefs about the consequences of 
performing a particular behavior (in a 
given situation), that is, the probability 
or improbability that the performance 
of behavior X will lead to some conse- 
quence y; (B,). 

1b. The evaluative aspect of B, that is, the 
S's evaluation of y, (a). 

2a. A normative belief, that is, the S’s belief 
about what he should do in this situa- 
tion (NB). 

2b. His motivation to comply with the 
norm, that is, his desire, or lack of de- 
sire, to do what he thinks he should do 
(Mo). 


Thus in the present adaptation of Dulany's 
theory, RHd (the hypothesis of the distribution 
of a reinforcer) is conceptualized as being anal- 
ogous to an individual's beliefs about the con- 
sequences of performing a specific behavior, 
and 4 (the affective value of the reinforcer) 
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is conceptualized as being equivalent to the 
evaluative aspects of those beliefs. It is inter- 
esting to note that this conceptualization re 
defines the first component of Dulany's theory 
((RHd)(A)] as a measure of attitude. That is, 
as was mentioned above, the work of Rosen- 
berg (1956, 1965), Zajonc (1954), Fishbein 
(1963, 19655, 1967a, 1967b), and others has 
provided strong evidence that an individual's 
attitude. toward any object can be predicted 
from a knowledge of the individual's beliefs 
about the object and the evaluative aspects of 
those beliefs. Algebraically, this may be ex- 
pressed as follows: 


* 
mon 
i=l 
where 


A, = the attitude toward some object “o” 
B, = belief i about o, that is, the probability 
that o is related to some other object x, 
a, = the evaluative aspect of Bọ that is, the 
respondent's attitude toward x, 
n — the number of beliefs 


It must be emphasized, however, that the 
attitude under consideration is an attitude 
toward performing a given behavioral act, and 
is not an attitude toward a given object, per- 
son, or situation. That is, the algebraic for- 
mula presented above leads to the hypothesis 
that an individual's attitude toward any object 
is a function of the individual's beliefs about 
that object. In the present analysis, we are con- 
cerned with an individual's beliefs about the 
performance of a given behavioral act, and thus 
the attitude being assessed is the individual's 
attitude toward the performance of that act. 
More specifically, from the point of view of 
Dulany's theory, we should be assessing the 
individual's beliefs about what will happen if 
he performs behavior X with respect to stimu- 
lus Y in situation Z, and the evaluative aspects 
of those beliefs, that is, we should be assessing 
the individual's attitude toward the perfor- 
mance of a given act, with respect to à given 
stimulus object, in a given situation. Although 
these beliefs may vary considerably as a func- 
tion of the stimulus object (e.g, the person) 
toward which the act is directed and the situ- 
ation in which the act is to occur (e.g, in 
public or in private), these beliefs are still 
beliefs about the performance of the act, and 
not beliefs about the stimulus object or the 
situation. We shall return to this point below. 


Turning to the second of Dulany's 
theory ((BH)(Me)}, it cam be seen that BH (the 
behavioral hypothesis) has been 
as a normative belief, that is a belief about 
what should be done in the situation. More 
specifically, this is a belief about whether the 
particular act should or should mot be per- 
formed. Here, however, it may be necemary 
to distinguish between two types of normative 
beliefs: (1) the individual's belief about what 
he personally feels he should do (ie, a per 
sonal norm or rule of behavior); and (2) the 
individual's belief about what "society" (Le, 
most other people, his "significant. others," 
etc.) "says" he should do (i.e, a social or group 
norm). Although an individual's personal 
norms will often coincide with, or reflect, the 
social norms, this is not necessarily the case. 
For example, while social norms concerning 
interracial relations are markedly different in 
some Northern and Southern communities, a 
given individual in a Southern community may 
hold the same personal normative belief as an 
individual in a Northern community. Thus it 
may be necessary to take both of those types 
of normative beliefs into account. This can 
easily be done, for as was mentioned above, 
the central equation of Dulany's theory takes 
the form of a multiple regression equation. 
Thus, as Dulany has pointed out, this theory 
is an open theory, and additional components 
can be taken into account. We shall discuss 
this more fully below. 

The final element in the equation (Mc—an 
individual's motivation to comply) is self- 
explanatory. Here we are concerned with the 
degree to which the individual "wants" to com- 
ply with the norm. Clearly, if two types of 
norms are considered, it will be necessary to 
measure the individual's motivation to comply 
with each of them. 

Returning to the central hypothesis, it can 
be seen that in its adapted form Dulany's 
theory essentially leads to the prediction that 
an individual's intention to perform any be- 
havior (and his actual performance of the 
behavior) is a function of (1) his attitude to- 
ward performing the behavior in a given situ: 
ation, and (2) the norms governing that 
behavior in that situation and his motivation 
to comply with those norms. Algebraically, this 
can be expressed as follows: 


B ~ BI = [As] + (NB)MO]w, 


However, as was discussed above, it is not clear 
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whether the second component [(NB)(Mc)] can 
best be viewed as referring to a personal norm, 
a social norm, or both. The simplest way to 
handle this problem within the framework of 
the theory is to expand the algebraic formu- 
lation as follows: 


B ~ BI = [Av uo + [(NB,)(Mc,)]w, 
+ [(VB,)(Mc,)]w2 


where the subscripts p and s refer to personal 
and social norms respectively.? 

Thus, as mentioned previously, the present 
approach suggests a radical change in investi- 
gating and understanding the attitude-behavior 
relationship. Instead of assuming some under- 
lying relationship between an individual's atti- 
tude toward a given object and his behavior 
with respect to that object, the proposed theory 
recognizes the importance of situational vari- 
ables, norms, and motivation as factors influ- 
encing behavior. Rather than viewing attitude 
toward a stimulus object as a major determinant 
of behavior with respect to that object, the 
theory identifies three kinds of variables that 
function as the basic determinants of behavior: 
(1) attitudes toward the behavior; (2) norma- 
tive beliefs (both personal and social; and 
(3) motivation to comply with the norms. Fur- 
thermore, although the theory suggests that 
other variables can also influence behavior, it 
indicates that these other variables operate 
indirectly, that is, by influencing one or more 
of the three basic determinants. 

Thus under certain conditions an individual's 
attitude toward a stimulus object may be re- 
lated to his behavior with respect to that object. 
For example, it is quite possible that an individ- 
ual's beliefs about the consequences of perform- 
ing some behavior (X) will be quite different if 
the behavior is to be performed with respect 
to someone he likes (p,) than with respect to 
someone he dislikes (p,). If this were the case, 
we would expect the individual to have a 
different attitude toward performing X with 
respect to f,, than toward performing X with 


5 An alternative formulation can be seen fol- 
lows: c» 


BI = [A lwo + pa nnana, | wi 


d-1 
This formulation suggests that it may be necessary 
to consider many different types of normative be- 
liefs; for example, beliefs about what one's (a) 
parents, (b) friends, (c) co-workers, (d) religious 
group, etc., "says" the individual should do. 


respect to f». In other cases, however, the con- 
sequences of performing a given act may not 
vary with variation in p, and here attitude 
toward performing X may not change. In the 
former case, it is possible that we would obtain 
a correlation beween attitude toward p and 
behavior with respect to p, while in the latter 
case this is unlikely. 

Similarly we might also expect an individ- 
ual's attitude toward a given stimulus person 
to influence his motivation to comply with a 
norm. That is, if we assume that an individual 
believes that he should perform behavior X 
with respect to individual p in situation X and, 
further, if we assume that the individual views 
behavior X as a behavior that indicates friend- 
ship (e.g. "invite to my club"), then it seems 
likely that the more positive the individual's 
attitude is to p, the more he will want to 
comply with the norm. Similarly, the more neg- 
ative his attitude is to p, the less he may want 
to comply with the norm, that is, the less he 
may want to behave in “a friendly manner" 
even though he believes he should. Here again, 
however, it must be recognized that there will 
be cases where the individual's motivation to 
comply may be completely unrelated to his 
attitude toward p. 

Along these same lines, it can be seen that 
variations in the situation may also influence _ 
one or more of the primary determinants of 
behavior. Clearly, an individual is likely to have 
quite different beliefs about the consequences 
of performing a given behavior in a public 
situation than in a private situation. Similarly, 
we would also expect that the individual has 
different normative beliefs (both personal and 
social) for different situations. Here too, how- 
ever, it must be noted that if the situational 
variables that are being considered do not in- 
fluence the individual's attitude toward the 
act, or his personal or social normative beliefs, 
or his motivation to comply with these beliefs, 
then, according to the theory, they will not 
influence his behavior. Indeed, one of the ad- 
vantages of the theory is that it provides an 
explanation for a large number of results that 
may initially appear inconsistent. That is, from 
the point of view presented here, one would 
expect a considerable amount of variation in 
the relationship between any given variable 
and behavior. 

In addition, it should be recalled that the 
specific weights given to the three basic com- 
ponents [ie., (4,4), (NB, x Mc,), and (NB, X 
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Mc,)] of the theory must be determined; in- 
deed, these weights are expected to vary from 
behavior to behavior. That is, with respect to 
some behaviors, the attitude toward the be- 
havior may be a more important determinant 
of behavior than either type of normative belief 
and the motivation to comply with these norms, 
while for other types of behaviors the weight- 
ings may be reversed. Similarly, while some 
behaviors may be primarily determined by per- 
sonal normative beliefs, other behaviors may 
be more under the control of social normative 
beliefs. 

Furthermore, just as the weightings of the 
three components may vary with the type of 
behavior that is being considered, they may 
also vary for different individuals. That is, for 
some individuals, behavior may be primarily 
determined by norms and motivation to com- 
ply with the norms, while for other individuals 
behavior may be primarily determined by their 
attitudes toward the behavior. For example, 
it is possible that the behavior of high authori- 
tarian individuals may be largely under the 
control of normative beliefs and motivation to 
comply with the norms. 

Thus, even though an “outside” variable 
(e.g., attitude toward a stimulus object) may 
be related to one of the basic determinants of 
behavior (e.g., attitude toward the behavior), 
the outside variable may still be unrelated to 
the actual performance of that behavior. 


SUMMARY 


The present approach clearly indicates that 
traditional measures of attitude (i.e, toward 
a given object, person, or class of people) are 
not likely to be related to behavior in any 
consistent fashion. Indeed, the main purpose 
of the present paper has been to reconsider, 
and seriously question, the basic assumption 
that an individual's attitude toward an object 
is a major determinant of his behavior with 
respect to the object. While other investigators 
have also argued that traditional attitude mea- 
sures will not predict behavior, they have 
usually questioned the measure of attitude 
rather than the assumption of an attitude-behav- 
ior relationship. More specifically, they have 
argued that most traditional measures of atti- 
tude are oversimplified; that most measures 
only consider an individual's “affective feel- 
ings," and fail to take his cognitions and cona- 
tions into account. Thus most investigators 


have attempted to resolve the attitude-behavior 
problem by expanding the definition of atti- 
tude to include affective, cognitive, and conative 
components. 

In contrast to this, the present paper has 
attempted to show that beliefs (cognitions) 
and behavioral intentions (conations) can best 
be viewed as determinants or consequents of 
attitude; that rather than being viewed as parts 
of attitude, these variables should be viewed 
as independent phenomena that are related to, 
and serve as indicants of, an individual's atti- 
tude. Furthermore, it was argued that even if 
an individual's beliefs about an object and/or 
his behavioral intentions toward the object 
are considered, it is not likely that behavioral 
prediction will be improved. First, these vari- 
ables may be highly correlated with traditional 
measures of attitude, and thus they will not 
explain additional variance in behavior. Sec- 
ond, even if this is not the case (ie. even if 
beliefs and behavioral intentions that are 
unrelated to attitude are selected for considera- 
tion), they may still be unrelated to the be- 
havior.f That is, viewing the attitude-behavior 
relationship within the framework of a 
multiattitude object-multimethod approach, it 
becomes clear that the most important deter- 
minants of behavior may be other variables 
than an individual's beliefs about, attitude to- 
ward, or general behavioral intentions toward, 
a given object. Indeed, this approach clearly 
indicates that behavior toward an object may 
be completely determined by situational or 
individual difference variables, rather than any 
variable associated with the stimulus object per 
se. In other words, this approach points out 
that behavior toward a given object is a func- 
tion of many variables, of which attitude toward 
the object is only one. 

Consistent with this position, a theory of 
behavioral prediction was suggested. In con- 
trast to previous attempts to resolve the 
attitude-behavior problem by expanding the 
definition of attitude, the proposed theory 
attempts to understand the attitude-behavior 
relationship by (1) considering a limited set 
of variables that function as primary deter- 
minants of behavior, and (2) analyzing the 
relations between these variables and tradi- 
tional methods of attitude. 


4 It should be recalled that the size of the rela- 
tionship between a behavioral intention and a be- 
havior is primarily a function of the specificity of 
the behavioral intention. 
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